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PREFACE 


When one looks back over the history of science, the successes are 
likely to be stressed and the failures forgotten. Thus one tends to see 
science as starting with a sure sense of direction and progressing neatly 
to its present form. Or so it is for the older and well established 
branches of science; but not for psychology. Psychology has not one 
Sure sense of direction but several quite unsure directions. Growth is 
erratic and there is much casting about for the most crucial problems 
and the most powerful methods. These apparent differences between 
psychology and the older branches of science may result from the 
difficulty of developing a science of man; it is perhaps significant that 
many of the problems of psychology were not attacked by the methods 
of science until so late a date in history. Or the differences may be an 
illusion resulting from the much closer view we have of the beginning 
struggles to develop a science of psychology than we now have of the 
beginning efforts in the older sciences, 

Certainly psychology has its problems, and they are not easy. 
Nevertheless, knowledge has grown rapidly in the short history of 
man’s efforts to develop a science of behavior, and the time seems ap- 
propriate for a major effort to examine the progress that has been 
made in attempting to find a way, Or ways, to the attainment of the 
explanatory power that we like to think of as characteristic of science. 
A growing body of empirical information, a serious concern over 
methodological issues, and a variety of efforts to bring a selected body 
of fact into the organizing framework of theory all emphasize the need 
for that line of questioning—always going on in science—which 
explores the shape of knowledge, the range and inner connections of 
the ideas through which it has been developed and organized, the 
changing substructures of empirical data, and their emerging relations 
to each other and to the findings of other sciences. The seven volumes 
of Psychology: A Study of a Science are a response to this need. 

The first three volumes, which bear the collective title Study I. 
Conceptual and Systematic, are concerned with many of the systematic 
formulations of recent and current influence which psychologists have 
developed to account for the phenomena in which they are interested. 

v 


vi PREFACE 

Each systematic position is analyzed by its originator, or a person con- 
nected with its development, in a way which gives attention to the 
problems it seeks to solve, the empirical basis on which it rests, its 
degree of success, and its relations to other formulations. 

A second set of three volumes, collectively called Study II. Empirical 
Substructure and Relations with Other Sciences, inquires, again through 
the efforts of creatively active investigators, into the organization of 
various fields of empirical knowledge, the relations of one to another, 
and to work going forward in other sciences. It also examines such 
problems in reverse through the participation of social and biological 
scientists who consider the relations of their own special fields to vari- 
ous parts of psychology. The three volumes of Study II, now in prepa- 
ration, will be published at a later date. 

Volume 7—Psychology and the Human Agent—will present the Study 
Director’s view of certain problems of psychological inquiry in the light 
of the findings of the project. 

Primary credit for the initiation of these studies goes to the Asso- 
ciation’s Policy and Planning Board, which decided in 1952 that the 
time had come for a thorough and critical examination of the status 
and development of psychology. The National Science Foundation 
agreed upon the desirability of such an undertaking and has gener- 
ously supported the effort. When funds from the National Science 
Foundation were found to be insufficient for all of the expenses of the 
studies, the American Psychological Association provided the sup- 
plementary funds necessary to complete the work. 

From the beginning, the study was divided into two parts. One part 
dealt with the education of psychologists and the factors conducive to 
research productivity in psychology. That part was directed by Profes- 
sor Kenneth Clark of the University of Minnesota, who has reported 
the findings in America’s Psychologists: A Survey of a Growing Profession, 
published by the American Psychological Association in 1957. 

The other part, the part with which the present series of volumes is 
concerned, has dealt with the substance of psychological thought and 
data. Professor Sigmund Koch of Duke University has been responsible 
for this part of the study. Working closely with him has been a panel of 
consultants consisting of Lyle H. Lanier, Howard H. Kendler, Conrad 
G. Mueller, and Karl E. Zener. These men, but chiefly Dr. Koch, have 
planned, organized, interpreted and edited the work, and successfully 
enlisted the cooperation of the approximately 80 authors whose origi- 
nal papers will constitute the basic material of the series. 

In the background, at a safe distance from the labors that have 
sometimes engulfed Dr. Koch, his panel of consultants, and the pri- 
mary authors, has been a steering committee on which I had the pleas- 
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ure of serving as chairman, and having as colleagues Clarence H. 
Graham, Lyle H. Lanier, Robert B. MacLeod, Eliot H. Rodnick, M. 
Brewster Smith, and Robert L. Thorndike. The steering committee 
helped to make administrative arrangements and helped to decide 
on the scope of the studies, but takes no credit for their successful 
completion. 

In the preface to America’s Psychologists we have already acknowl- 
edged our gratitude to Kenneth Clark and his collaborators who 
helped to produce that volume. It is our final pleasant duty to express 
our thanks to Duke University for making Dr. Koch’s time available; 
to the National Science Foundation for its necessary and generous 
financial support and for the counsel and support of John T. Wilson, 
Assistant Director for the Biological Sciences; to Lyle H. Lanier, 
Howard H. Kendler, Conrad G. Mueller, and Karl E. Zener for their 
critical and devoted help; to all of the authors whose names appear on 
the title pages for their original contributions; and—most of all—to 
Sigmund Koch for directing and driving through to completion what 
we hope will be an oft-consulted aid to the scholars and research 
workers who are striving to increase the rigor and further the develop- 
ment of scientific psychology. 

Dael Wolfle, CHAIRMAN 
STEERING COMMITTEE 
POLICY AND PLANNING BOARD 
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INTRODUCTION TO VOLUME 2 


Psychology: A Study of a Science is a report of inquiries into the 
status and tendency of psychological science. Some eighty distinguished 
authors have contributed sustained essays which consider: (Study I) 
major theoretical formulations of recent importance; and (Study IT) the 
structure, mutual interrelations, and associations with other sciences of 
the main empirical areas in which psychological research is pursued. The 
findings of Study I Conceptual and Systematic comprise the initial 
three volumes of the series; Study II Empirical Substructure and Rela- 
tions with Other Sciences is reported in the following three volumes. A 
final volume by the Study Director—Psychology and the Human Agent 
—includes commentary on the significance of the findings. 

The present volume is the second in the series and is part of Study I. 
Each of the twelve essays in this book is a self-contained presentation 
which may be read with profit independently of the others, or of the con- 
tents of other volumes. Yet the reader must bear in mind that the present 
volume is only a fragment of Study I, and that Study I is part of a 
larger enterprise having certain unifying values, aims, and methods. 
For a conception of these latter, the reader is referred to Dael Wolfle’s 
Preface in Volume 1, and to the two introductory sections—one for the 
series, the other specifically for Study I—also contained in that volume, 

As an immediate aid to the reader, a resumé of the Study’s design is 
given. 

Study I Conceptual and Systematic. This study involved the in- 
tensive analysis of thirty-four “systematic formulations” of widely varying 
type and subject-matter reference and all of established influence in 
recent psychology. A “systematic formulation” was defined quite gen- 
erally as “any set of sentences formulated as a tool for ordering knowl- 
edge with respect to some specified domain of events, or furthering the 
discovery of such knowledge”: in applying this definition, care was taken 
that no formulation be precluded by nonconformity to standardized con- 
ceptions of the nature of “theory.” Since each systematic formulation is 
the end-product of a human effort to see and state order in a given do- 
main, each analysis was made either by the originator(s) of the formula- 
tion in question or (in a few cases) by individuals creatively associated 
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with the development of formulations of which they were not the primary 
authors. 

Each systematist was invited to approach his work with certain com- 
mon themes of analysis in mind. These were designed to invite a con- 
vergence of insight on those problems of systematization which had 
emerged from the practice of the past three decades, more or less. Some 
of the suggested problems had been conspicuous in previous “meta- 
systematic” discussion, but required in our opinion exposure to a wider 
range of systematically schooled sensibilities. Others were problems that 
seemed critically posed by recent systematic work, yet ones which had 
received little or no explicit attention. 

The dominating hope was for analyses that might illumine the rela- 
tions between the creative processes of systematizing and their publicly 
expressed products. It was thus hoped that the atmosphere of the study 
might encourage as much concern with background influences, orienting 
presuppositions, and working methods, as with conceptual content, re- 
search achievements, and prospects. It was felt that analysis of this order 
could itself have creative consequences; reflective scrutiny of the extent 
and depth envisaged means rethinking. The primary intent of the dis- 
cussion themes (and indeed the constant aim of all editorial effort) was 
to realize an atmosphere that might invite such emphases. Authors were 
requested to make explicit reference to the themes in their writing only 
to an extent they deemed appropriate or congenial. The use of the themes 
for facilitating the collation of findings was thus a secondary, if still im- 
portant, aim. As matters turned out, most authors adhered to them suffi- 
ciently to give the reader an excellent purchase for the detection of simi- 
larities and differences on key issues. 

The grounds for the selection of the thirty-four formulations included 
in Study I are given in Volume 1 (pp. 21-27). The aim was a reason- 
ably balanced diversification of formulations (as judged by many con- 
sultants) with respect to (a) subject-matter reference, and (b) con- 
ceptual and methodological “type.” Many significant formulations that 
we would have wished to represent in the original list were excluded by 
spatial and other arbitrary restrictions. Nor are all formulations originally 
chosen included in the present volumes. Though the proportion of in- 
clusions is remarkable, there were some individuals who could not partic- 
ipate. We do not, then, claim “representativeness” even in an informal 
and impressionistic sense. We do, however, claim sufficient diversity to 
extend markedly the range of formulations which in recent years have 
been given sustained analytic attention. 

Study II Empirical Substructure and Relations with Other Sci- 
ences. This study seeks increased understanding of the internal structure 
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of psychological science and its place in the matrix of scientific activity. 
Over forty contributors, having distinguished research backgrounds in 
psychology, or in related biological and social sciences, were invited to 
write papers which examine the organization of empirical knowledge 
within subareas of these disciplines, and which chart their cross con- 
nections. Psychologist contributors consider the relations between their 
own fields of special competence and the rest of psychology, and inquire 
also into relations with relevant segments of other sciences. Social and 
biological science contributors examine the relations between their own 
fields and psychology. 

All authors are individuals whose research interests have bridged con- 
ventionally discriminated fields of knowledge. Each was asked to place 
special emphasis on those “bridging problems” which had been central 
in his own research experience. As in the case of Study I, an attempt was 
made to encourage differentiated and stratified analysis, and to invite a 
convergence of visions on significant cross-cutting issues, by proposing 
certain common themes of analysis. The “themes” for Study II comprise 
a detailed breakdown of the senses in which questions of “mapping” 
subject-matter structure and exploring field interrelations might be 
entertained. 

Though the topography of a science is too vast and labile for com- 
prehensive or final mapping, this very fact makes it more important to 
assay the contours of knowledge as best we can. Study II exploits the 
only resource available for such problems—individual vision—but in a 
novel way. It assumes that a pooling of expert, specialized sensibilities 
can give insight into the emerging structure of a science of a sort not 
ordinarily available. 

A fuller statement of the plan for Study II appears in the General 
Introduction to the Series (Vol. I, pp. 1-18). An adequate account of 
working methods must await publication of the completed study. 

Psychology and the Human Agent. This volume is a postscript to the 
Study, representing certain views formed by the Director in its course. 
The book (a) records those attitudes towards a science and science which 
necessarily color the spirit of the Study, (b) constructs trends from the 
massive findings of the two group studies, and (c) considers, in the light 
of the Study’s premises and apparent trends, certain problems of psycho- 
logical inquiry suggested by the practice of the past several decades. 


In this day in which “self studies” and reductions of enigma by 
seminar are becoming commonplace in social science, it may be helpful 
to mention a few of the special features of Psychology: A Study of a 
Science. May we stress (in random order) the following points: 
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1. Both group studies are “collaborative” but only in the special 
sense that many creative men agreed to pursue, within the climate of the 
Study, individual tasks of vital interest to themselves. The Study is not 
collaborative or “groupish” in any sense implying an intention to re- 
linquish individuality or even idiosyncrasy to some prissy conception of 
the common weal. 

2. The Study aims for no grand “integration” of knowledge. If a 
“Summa. Psychologica” or even a “Synopticon” had seemed even re- 
motely within reach, our inclination would have been to abstain on prin- 
ciple. The Study secks to reflect the diversity of thought that actually 
exists and is premised on the values of widespread diversification of ap- 
proach as a condition to the health of a science. It conceives of its con- 
tributor groups as pluralities of creative individuals who view those areas 
which they know best through the screen of their own expert sensibilities. 
The discussion outlines invite the play of individual sensibilities on com- 
mensurable themes, thus helping the reader to collate positions. 

3. The Study’s aims are neither legislative nor evaluative. It rejects 
all monolithic codes for the generation and processing of knowledge, or 
for virtuous scientific conduct. If there is a central bias, it is for the 
loosening of those constraints which can keep men from significant prob- 
lems or thoughts through fear of the unorthodox. 

The Study by no means devalues the insights of recent “science of 
science” but would wish them set in a perspective better adjusted to a 
field barely beginning to test established methods of science on an illimit- 
ably diverse and intricate collection of subject matters. Such a perspective 
can emerge by seeing the end-products of science as everywhere condi- 
tioned by human decision, value, creative option; by freeing from stale- 
ness that truism which holds scientific inquiry to be continuous with 
other human activities. Analysis in science, then, becomes more than a 
succession of routine tasks in the “logic” of science; it becomes an enter- 
prise which can uncover the significance of its objects only by holding in 
view the relations between creative process and sentential product, stra- 
tegic gamble and cognitive outcome. 


For those connected with psychological science, neither the con- 
tributors to the present volume nor the topics will require special intro- 
duction. Six of the formulations herein discussed—those of Lewin 
(treated by Cartwright), Tolman, Guthrie, Miller, Hull-Spence (treated 
by Logan), and Skinner—have long represented major systematic ref- 
erence points in fundamental psychology. Estes’ formulation is perhaps 
outstanding in the recent class of efforts towards a statistical systematiza- 
tion of learning. Harlow and Irion consider the systematic import of 
relatively limited domains of learning data, one of them (“learning sets”) 
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a field of investigation only recently opened up, and the other (rote learn- 
ing) the oldest area of learning experimentation. Hinde presents what to 
date must be a unique analytico-systematic appraisal of the work on ani- 
mal behavior by European ethologists. Finally, Frick and Ellson present 
general analyses of two of the mathematical formulations (information 
theory and linear frequency theory) which have been most heavily drawn 
upon in recent attempts to achieve “mathematical models” of psycho- 
logical and behavioral processes. Be it noted that the two latter formula- 
tions are of methodic import in that they are relational analyses appli- 
cable (at least in principle) to classes of possible event-systems; all other 
formulations considered herein have, in the first instance, substantive 
reference. 

The twelve formulations included have a loose family relationship, 
though one difficult to specify. It is possible to say that, by and large, they 
are “behavioral? formulations, that most give primary or strong empha- 
sis to problems of learning (and would be called instances of “learning 
theory”), that most have explanatory intentions of some generality. But 
these are “by and larges” and “mosts.” It can assuredly be said that the 
formulations represented in this volume have (in their respective ways) 
powerfully determined the horizons of thought in what might be called 
“behavior theory” by some, “basic psychological theory” by others. 

Mention of certain editorial provisions is in order. Readers will find 
the complete statement of discussion themes, as sent to contributors, re- 
produced in the Appendix. There is, of course, variation in the extent to 
which the different presentations adhere to the discussion themes. As an 
aid to readers interested in the detection of key convergences and diver- 
gences of positions, index numbers corresponding to the principal thematic 
items have been inserted, where relevant, in the individual tables of con- 
tents appearing before each paper. The system of index numbers is ex- 
plained in the Appendix. 

A statement of a few trends of Study I, based on the essays in all 
three volumes, appears as an editorial epilogue in Volume 3. The state- 
ment is brief, being restricted to those generalizations which seem so 
obvious as to be almost self-defining. Further discussion of trends will be 
offered in the final volume of the series, Psychology and the Human 
Agent. There is, however, no standard “theory” of the meaning of this 
study. If there were, we would consider the main aim compromised. 
That aim has been to develop materials of unique comprehensiveness 
and depth in terms of which each reader may enrich his own view of 
systematic psychology. The thirty-four essays of Study I can reward ef- 
forts towards secondary analysis and synthesis—whether by student, 
specialized scholar, or general reader—for a long time to come. Let there 
be as many theories of this study as there are readers. 
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Psychology: A Study of a Science is the result of a project sponsored 
by the American Psychological Association and subsidized by the Na- 
tional Science Foundation. The project was known as “Project A” of the 
“APA Study of the Status and Development of Psychology.” The work 
profited from the counsel of an Advisory Committee consisting of Dael 
Wolfle, Chairman, and Clarence H. Graham, Lyle H. Lanier, Robert B. 
MacLeod, Eliot H. Rodnick, M. Brewster Smith, and Robert L. 
Thorndike. Howard H. Kendler, Lyle H. Lanier, Conrad G. Mueller, 
and Karl E. Zener composed a Panel of Consultants to the Director. The 
generous part played by the members of both groups is described in the 
introductory sections of Volume 1. 


LEWINIAN THEORY AS A CONTEMPORARY 


SYSTEMATIC FRAMEWORK 


DORWIN CARTWRIGHT 
Research Center for Group Dynamics and De- 
partment of Psychology, University of Michigan 
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which they follow. See Note on the Use of Discussion Topic Index Numbers in Appendix. 
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This paper examines the significance of Lewinian theory for 
psychologists of today. The theory, as embodied in the publications of 
Lewin, was developed during a period of about thirty years which ended 
with his death in 1947. Its influence has been strong and pervasive, 
not confined to any single branch of psychology nor to those psychologists 
who were personally associated with him. For this reason, my task here 
is a difficult one. I cannot “speak for” those who have been influenced 
by Lewin’s work nor can I presume to guess how Lewin would have 
modified or extended his thought in the light of recent developments. 
This paper, therefore, must be a personal one in which I give my own 
diagnosis of the present condition of psychology and my own view of the 
significance of the general Lewinian approach for the tasks that lie 
ahead.* Since Lewin’s influence has permeated psychological thinking 
for more than a quarter-century and since his own thought was developed 
in a widely shared intellectual climate, it is natural that the point of 
view advocated here will contain many features acceptable to a great 
variety of psychologists. I shall try, however, to emphasize those aspects 
of Lewinian thought which in my opinion represent a distinctive 
orientation. 

It will be useful to begin with a general characterization of the nature 
of Lewin’s contribution to psychology. He referred to his work by various 
names, “dynamic theory,” “topological psychology,” “vector psychology,” 
and “field theory,” but his final preference seems to have been “field 
theory.” What, then, is field theory? Lewin [95, p. 45] answered this 
question most concisely when he asserted that “field theory is probably 
best characterized as a method: namely, a method of analyzing causal 
relations and of building scientific constructs.” In other words, it is a set 
of beliefs about the proper way to build empirical theories; it is, in 
current terminology, a metatheory, But this is not the entire story, for 
Lewin’s impact on psychology derives from more than his metatheoretical 
contributions. His influence has been exerted also through certain con- 
structs which he introduced. The current language of psychologists 
contains many terms due to Lewin: life Space, psychological force, 
valence, tension, level of aspiration, time perspective, and others, In 
this paper we shall be concerned with field theory both as a theory 


` The reader is referred to three other critical expositions of Lewinian theory. 
The first, by Leeper [85], is a detailed analysis of Lewin’s topological and vector 
concepts. The second, by Deutsch [40], discusses the significance of Lewinian theory 
for social psychology. The third, by Escalona [44], reports especially certain recent 
developments in work with children. Although I cannot agree with every detail 
of these three treatments, I can recommend them highly to anyone wishing to 
understand the nature of Lewin’s contribution to psychology. Because these three 
discussions together provide a comprehensive treatment, this paper will necessarily 
cover many of the points contained in them. 
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about theorizing and as a set of constructs for describing psychological 
phenomena. 

Because of the relatively heavy emphasis in Lewinian theory on meta- 
theoretical and programmatic problems, I have not found it possible 
in this chapter to conform strictly to the outline provided by the editorial 
committee. Where it has seemed necessary to deviate, a footnote indicates 
the relation of the outline followed here to that recommended by the 
committee, 


BACKGROUND FACTORS AND ORIENTING ATTITUDES 


What is the task of psychology? Most psychologists would agree that 
the task of psychology, stated in general terms, is to understand, explain, 
or predict human behavior. If one does not press too hard concerning 
the precise meaning of such terms as “behavior,” “understand,” or 
“predict,” there appears to be good agreement as to the goals of 
psychology. If, however, one examines how particular psychologists go 
about their work and attempts to infer from this what the actual goals 
are, then agreement is not so evident, 

Contemporary psychological thought is heavily determined by the 
fact that psychology gained its independence from philosophy only in 
the last half of the nineteenth century. The right to independence, the 
justification for separate status, was the psychologist’s ability to employ 
experimentation and measurement. These two marks of “science” served 
as indicators of respectability among psychologists. “If you can’t manipu- 
late it experimentally or at least measure it, you shouldn’t talk about it” 
became the psychologist’s creed. Sensation, perception, learning (as a 
modification of observed behavior), and ability (defined as a test score), 
by meeting these requirements reasonably well, threatened for a time 
to become the field psychology. Such a narrow designation did not 
achieve full acceptance only because of the persistence of a “lunatic 
fringe” who insisted that human experience and behavior are “richer” 
than this list of topics would suggest, that man loves, hates, is driven 
by anxiety, holds doggedly to prejudices, fights and dies for abstract ideas, 
creates art, writes poetry, and responds deeply to religion. Although 
respectable psychologists have come more and more over the years to 
recognize the possibility of Measuring such “fuzzy” phenomena and of 
doing experiments upon them, conflict still exists between those who 
would be true to human experience and those who would be “scientific” 
above all else, and tension remains within those who attempt both. 
This tension is symptomatic of some of the most basic issues of present- 
day psychology concerning its task in the immediate future, 

For Lewin, and those who accept his orientation, the most important 
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task of psychology is to devise ways of treating the full empirical reality 
of human experience and behavior in a scientific manner without doing 
violence to them. If “hope” or “frustration” cannot now be ‘measured 
precisely, they should not for this reason be exiled to the world of 
the novelist. Nor should they be tortuously “explained” in terms of 
some inappropriate theory simply because the theory is quantitative or 
formally elegant. Rather, research tools and concepts appropriate to the 
degree of available knowledge should be employed, and any step toward 
extending the legitimate domain of psychology should be cause for 
gratification. The major contributions of a Freud or a Lewin consist in 
making psychology better able to deal with such things as dreams, 
aspirations, frustration, social climates, and other undeniable, though 
“messy,” facts of life. 

This preference for what Robert MacLeod once called “sloppy 
validity over neat reliability” is not, of course, held only by field 
theorists, nor does it characterize the essential features of field theory, 
but it may help account for the complaint of Estes [45] that field 
theorists have contributed little to the limited domain of psychological 
activity referred to as “behavior theory.” Field theory, together with 
gestalt psychology more generally, developed as a revolt against the 
prevailing tendency to prejudge the nature of psychological phenomena 
by imposing a priori dicta concerning the properties they must have, 
Neither commitment to a theoretical belief (e.g., that perception must 
be composed of elements of sensation) nor allegiance to a methodological 
principle (e.g., that all data must be scalable) should become so im- 
portant to the psychologist that he blinds himself to the proper subject 
matter of psychology. It is the full understanding of this rich, com- 
plicated, and often baffling subject matter that is the ultimate goal of 
psychology. Every theoretical formulation and every method of research 
must be evaluated in terms of its contribution toward achieving this goal. 
The ideal that psychology as a science should consist of precisely 
quantified data organized into a completely coherent and logically 
rigorous system of theory is indeed attractive. But one may seriously 
question whether this ideal will best be realized by insisting at every 
stage of research that rigor and precision be accorded the highest priority. 

Open-minded observation. Discussions of psychological theory tend to 
concentrate nowadays on the logical and semantic aspects of the symbols 
that constitute a theory. And, for some reason, the derivation of Boyle’s 
law from the molecular theory of gases inevitably enters as the ideal 
to which all psychologists should aspire. I do not wish to disparage con- 
cern for the logical properties of conceptual systems (field theorists are 
greatly interested in such matters) nor to depreciate the elegance or 
practical utility of the molecular theory of gases, but I am greatly afraid 
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that the contemporary interest of psychologists in formal theory has 
tended too much to minimize the important task of observation. Barker 
and Wright [13] have recently voiced this same fear when they assert 
that psychology is less interested than other sciences in the painstaking 
observation of phenomena in their natural state. 


Although we have daily records of the behavior of volcanoes, of the 
tides, of sun spots, and of rats and monkeys, there have been few scientific 
records of how a human mother cared for her young, how a particular 
teacher behaved in the classroom and how the children responded, what a 
family actually did and said during a mealtime, or how any boy lived 
his life from the time he awoke in the morning until he went to sleep 
at night [13, p. 2]. 

The approach to psychological research advocated here is hardly 
debatable in principle; everyone favors empirical data. And yet it is 
remarkable what low status is given, by American psychologists at 
least, to a “merely phenomenological analysis” and how great a pressure 
there is to reject “clinical insight” as a proper aid to scientific psychology, 
The high status accorded formal rigor, experimental control, and precise 
measurement threatens to blind psychology to reality as it actually exists. 
The great activity of recent years, to cite one example, devoted to 
measuring “empathic ability” has produced indecisive conclusions not 
merely because of the logical flaws of measurement pointed out so ably 
by Gage and Cronbach [64] but perhaps more importantly because of 
the neglect of such careful phenomenological analyses as that of Scheler 
[121] concerning the nature of sympathy. One reason that so much 
of psychological theory is “impractical,” and thus one source of the un- 
fortunate gulf between pure and applied psychology, is the tendency 
of psychological theorists to select for investigation only those variables 
which lend themselves immediately to the niceties of formal and 
methodological manipulation. Psychological theorists must not forget 
the importance of maintaining an open-minded look at what people 
actually think, feel, and do and at the settings or situations in which 
events take place. Any theory will be judged ultimately by how well it 
copes with descriptive data of this sort. 

In order to gain a balanced view of Lewin’s basic orientation, it is 
important to recall not only his concern for mathematical models and 
formal systems in such works as Principles of Topological Psychology and 
The Conceptual Representation and Measurement of Psychological 
Forces but also the great value he placed upon the movies he took of a 
single child trying to sit on a stone or to walk up stairs for the first time 
and upon such richly descriptive studies as that of Dembo [37] on anger. 
It is significant that, although Lewin was disturbed by the dominance 
of “dust-bowl empiricism” in American psychology during much of his 
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life, he felt constrained time after time, in presenting his topological 
concepts, to point out that these were intended to be aids to description. 
“In presenting them we are not promulgating a new ‘system’ limited to 
a specific content, but rather we are describing a ‘tool,’ a set of concepts 
by means of which one can represent psychological reality” [91, p. 6]. 
In discussing how his objectives compared to those of Hull he developed 
this point further, “Psychology needs stressing the formalistic. Yet, it 
would soon prove most unfortunate if one should lose sight of the fact 
that the main purpose of psychological theories is, after all, to explain 
reality. In psychology, for a long time to come, the richness and fruit- 
fulness of theories should not be judged by their formalistic perfection 
alone” [91, p. 22]. 

Description and explanation. One could easily caricature the position 
advanced here by saying that it would have psychologists act like a 
court reporter whose assignment is simply to get down on paper every- 
thing that is said or done. This would be a caricature of the intended 
point of view because the essential argument is not that the psychologist’s 
task consists of observation but that psychological theory should begin 
with observations of naturally occurring behavior and should be evalu- 
ated in terms of how adequately they help us understand such be- 
havior, For observation to be of any use to the psychologist it must be 
converted into a description, And the essential difference between a 
psychologist and a reporter or novelist lies in the kind of description 
provided. 

One of the basic paradoxes of science is that, whereas observation 
provides the ultimate test of a scientific theory, the language of 
description employed by the scientist influences his observations, The 
very fact of having a descriptive concept available serves to sensitize 
the observer to certain phenomena; it helps him to see what is there. 
At the same time, an “incorrect” concept may lead him to give 
a distorted picture of actual events. For this reason, many advances in 
psychology consist of the creation of new concepts, and much of the 
difficulty in making progress stems from an uncritical acceptance of the 
language generally employed by psychologists. 

The psychologist, then, is properly concerned with the language of 
descriptions that he uses. To him any description should be “two-faced,” 
looking simultaneously to the world of data and to that of concepts. It 
is this double-directedness of description that allows concepts both to 
describe and to explain. A “mere description” consists of a set of 
concepts which refer primarily to data and only loosely to a system 
of concepts. If, however, it can be shown that certain events “must” 
occur because the corresponding concepts are related in a particular way, 
then these events have been explained. 
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Lewin stated this point of view as follows: 


The task of dynamic psychology is to find the psychological laws and 
to represent the situation in such a way that the actual events can be derived 
from it in a conceptually univocal manner. If the objection is raised that it 
is self-evident that the events follow from these representations of situations, 
one has to answer that this is exactly the purpose. What we are trying to do 
is to represent situations in such a way that the events follow from them 
“self-evidently,” namely as purely logical consequences. 

If one wants to call this “description,” it is not worth while to quibble 
over words. But if one considers conceptual derivation and the transition 
from phenomenal to dynamic facts as the characteristics of an explanation, 
then what we have here is in fact explanation [91, p. 82]. 


In this connection it may be helpful to consider briefly the accusa- 
tion that field theorists cannot predict but can only account for psy- 
chological events after they have happened. This criticism may have 
two different meanings. The criticism may mean that field theoretical 
explanations are “contained in” the description of particular situations. 
The answer then consists of maintaining that it is not necessary to in- 
voke causes outside the situation to explain it. An adequate description 
of a situation together with a knowledge of the relevant psychological 
laws permits one to derive the events immediately brought about by the 
situation. Thus, to give an oversimplified example, if one knows that the 
need tension of an individual is at a certain level and if one knows the 
law relating tension to tendencies to action, then it is possible to derive 
(in this sense, predict) that certain action will occur. The criticism may, 
on the other hand, imply that the laws of psychology are not sufficiently 
developed and the techniques of measurement and observation are not 
now adequate to allow absolutely certain derivations from the de- 
scription of one concrete situation to the properties of a subsequent one. 
The answer to this form of the criticism can only be that field theorists 
are in no different status, in principle, than other theorists. 

Language of description. Since the description of any concrete situ- 
ation consists of a symbolic representation of it, one must be concerned 
with the choice and manipulation of symbols. What language should 
psychologists employ? The answer to this question is a point at which 
field theorists part company with many other psychologists. To the field 
theorist it appears that psychology should describe events in a psycho- 
logical, rather than physical or physiological, language. Without deny- 
ing the possibility of an eventual unification of science so that a single 
language may be used, the field theorist asserts that insistence upon 
using the language of physics or physiology at this time imposes harm- 
ful restrictions upon accurate description of psychological events. 

Lewin believed that use of the language of physics or of physiology 
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consists actually of employing an inappropriate model for representing 
psychological events. His discussion of this problem may seem strange 
to those who have categorized Lewin as a “model builder.” 


We have intentionally avoided the use of any model of a physical or of a 
nonphysical nature for the explanation of psychological dynamics. 

Like an illustration the working out of a model can have a certain 
value. On the other hand it can, especially in psychology, involve serious 
dangers: a model usually contains much that is purely arbitrary. One usually 
uses it like an illustration only in so far as the analogy holds, i.e., really 
only as long as it is convenient. As soon as consequences ensue which do not 
agree with the real facts, one evades the difficulty by asserting that it is after 
all only a model or an illustration. . . . In this respect model and illustra- 
tion are sharply distinguished from the mathematical representation which 
we are trying to attain. If one decides to represent a real fact by a 
mathematical concept then one is forced to acknowledge all the consequences 
which are involved in this concept. . . . 

One must welcome every attempt to go beyond vague ideas to concrete 
formulations in psychological dynamics. Often the psychologists who take 
this task seriously have been driven to attempt to make “physiological” 
theories, I do not consider that tendency fortunate. Certainly one cannot 
object in the least to applying physiological methods and to including in the 
theory material so obtained. . . . But the so-called physiological theories 
which are based on psychological facts have almost always the character of 
a physiological or rather physical model of which the same is to be said 
as of the other models. With a physiological model, too, the relationships 
with which one is concerned are not expressed directly in concepts but only 
indirectly in illustration, and these often include superfluous specializations. 
The task of scientific research is, however, to determine the dynamic 
characteristics of the facts themselves [91, pp. 77-80]. 


A much more recent analysis of hypothetical constructs by Roze- 
boom [119] places this point of view in the context of current dis- 
cussions of “mediation variables.” 


Neurophysiological content is not only at present inadequate for be- 
havioral HCs (hypothetical constructs), but also basically irrelevant. Pre- 
suming the switchboard theory of neural action to be true, we could in 
principle replace each neurone with a mechanical or electronic device with 
the same connections and input-output characteristics as the replaced cell 
without affecting the validity of any behavioral law. Since neurophysiological 
attributes are thus not essential for a physical behavior system, a set of HCs 
need possess no nuerophysiological content in order to describe the necessary 
properties of the causal mediators underlying behavior. . . . I suggest, 
therefore, (a) that despite the complexity of the central nervous system, 
those of its properties which are essentially involved in the causal deter- 
mination of the dependent variables of behavioral psychology may possibly 
be reflected by a relatively small number of HCs without neurophysiological 
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reference; and (b) that the necessary ontological content of such HCs need 
consist only of existential quantification (the logical operator “Yx”) and 
relational predicates [119, p. 262]. 


Even in describing the environment of a person a physical descrip- 
tion is not sufficient. Although no one would deny that the environment 
is mediated by physical processes, there is real question as to the present 
possibility of describing this environment in the language of physics, 
employing only such terms as electromagnetic field, molecule, ion, or 
centimeter, gram, and second. Cassirer has stated the problem in the 
following terms: 


No longer in a merely physical universe, man lives in a symbolic universe. 
Language, myth, art, and religion are parts of this universe. They are the 
varied threads which weave the symbolic net, the tangled web of human 
experience. All human progress in thought and experience refines upon and 
strengthens this net. No longer can man confront reality immediately; he 
cannot see it, as it were, face to face. Physical reality seems to recede in 
proportion as man’s symbolic activity advances. Instead of dealing with 
things themselves man is in a sense constantly conversing with himself. He 
has so enveloped himself in linguistic forms, in artistic images, in mythical 
symbols or religious rites that he cannot see or know anything except by 
the inter-position of this artificial medium. His situation is the same in the 
theoretical as in the practical sphere. Even here man does not live in a 
world of hard facts, or according to immediate needs and desires. He lives 
rather in the midst of imaginary emotions, in hopes and fears, in illusions 
and disillusions, in his fantasies and dreams. “What disturbs and alarms 


man,” said Epictetus, “are not the things, but his opinions and fancies about 
the things” [31, p. 43]. 


A proper respect for the nature of psychological phenomena places 
upon psychologists the task of constructing a language adequate to 
represent these phenomena. Presuppositions about the ultimate unity 
of science or about the biological nature of man should not be allowed 
to force an arbitrary descriptive language upon psychology. 

Concreteness. One final orienting attitude of field theorists deserves 
careful examination here. An important value to be gained from in- 
sistence on a painstaking observation of natural events is the stress it 
continually places upon concreteness. As Lewin has pointed out: 


Only what is concrete can have effects. This proposition may seem 
obvious. But one often ignores it in explaining an event by development, by 
adaptation . . . by an abstract drive, and in treating these principles as 
concrete causes. . . . These fallacies arise in part from a confusion be- 
tween the law that governs the effects of certain concrete events and these 
events themselves, Effects can be produced only by what is “concrete,” i.e., 


Lewinian Theory as a Contemporary Systematic Framework 17 


by something that has the position of an individual fact which exists 
at a certain moment [91, pp. 32-33]. 


Once this principle is accepted, Lewin believed, four important 
consequences follow. (1) Every specific instance of behavior must be 
viewed as the result of the interaction of several features of a given 
concrete situation. (2) The description of human behavior cannot, 
without peril, concentrate exclusively on cognition, or learning, or 
motivation, or personality, or social influences, or economics, or politics, 
or culture, (3) A distinction must be made between the concrete field, 
or life space, in which behavior takes place and the phase space, or 
property space, which may be employed to represent quantitative rela- 
tions among properties. (4) Causation must be viewed as contemporary 
with the event caused; “since neither the past nor the future exists at 
the present moment it cannot have effects at the present” [91, p. 35]. 

1. Lewinian field theory and gestalt psychology both emphasize the 
importance of recognizing the interdependence of parts of any concrete 
situation. The classical demonstrations of gestalt phenomena in percep- 
tion illustrate the point adequately. A sensitive analysis of how a given 
child learns in a classroom will also reveal the interdependence of such 
aspects of the situation as state of tension, level of aspiration, time per- 
spective, relations with teacher and other students, social pressures 
from family, and even the family’s relations with other families within 
the community. Stating the point as an abstract principle, Lewin said, 
“only by the concrete whole which comprises the object and the situa- 
tion are the vectors which determine the dynamics of the event defined” 
[90, p. 29]. 

2. If behavior is determined by the concrete total situation in which 
it exists, then any theory which attempts to account for behavior solely 
in terms of learning, or motivation, or any other limited aspect of the 
total complex of determinants cannot be adequate. It is true that no com- 
prehensive theory of human behavior has yet been developed by field 
theorists (or anyone else), but field theorists attempt to formulate their 
theories in such a way that a representation of the total situation is 
possible in principle. 

Lewin’s own professional career illustrates how a commitment to 
represent faithfully the concrete situation requires one to cross the 
boundaries of the traditionally designated “fields of psychology” and 
even of the academic “disciplines.” Lewin’s early research dealt with 
learning and perception [87, 88, 89, 99] but this led him to see the in- 
evitable interplay between these processes and motivation, as illustrated 
for example in the Zeigarnik effect [142]. Investigation of these moti- 
vational influences led in turn to an analysis of reward and punishment 
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[90, pp. 114-170] and of conflict [90, pp. 66-113], both of which re- 
quired inclusion of motivational forces “induced” by other people. Once 
the importance of these social influences was recognized, it became ap- 
propriate to examine how one person, e.g., a group leader, uses his 
ability to induce forces on another person to help or restrict him. Closer 
scrutiny of these aspects of the situation then led to the studies of 
“types” of leadership and of social atmospheres [94, pp. 71-83, 98]. It 
led also to studies of the constellation of forces determined by group 
membership [94, pp. 84-102, 145-158], and by the values of groups 
[94, pp. 56-68]. Finally, it led to an examination of the restraints and 
compulsions exerted by technology, economics, laws, and social “‘chan- 
nels” and “gatekeepers” [95, pp. 170-187]. The important feature of 
this whole development is that the desire to represent faithfully the 
full complexity of concrete situations resulted in a continual crossing 
of the traditional boundaries of the social sciences rather than a pro- 
gressive narrowing of attention to a limited number of variables. This 
insistence upon the complexity of the determinants of any concrete 
instance of behavior makes the task of building a true “behavior the- 
ory” extremely difficult. The field theorist believes, however, that the- 
ories which do not take these complexities into account are bound to 
be not merely limited, or miniature systems, but actually incorrect. 

3. The third consequence of the principle that “only what is con- 
crete can have effects” is that a distinction must be made between “life 
space” and “phase space.” Although this distinction was never exten- 
sively discussed by Lewin and the full ramifications of it for psychology 
have never been worked out, I believe that in it lies a clarification of 
the field theorist’s treatment of interdependence. Lewin’s brief discussion 
of the problem may be illuminating. 


Success in a certain sport may depend upon a combination of muscular 
strength, velocity of movement, ability to make quick decisions, and precise 
perception of direction and distance. A change in any one of these five 
variables might alter the result to a certain degree. One can represent these 
variables as five dimensions of a diagram. The resultant of any possible 
constellation of these factors for the amount of success can be marked as a 
point in the diagram. The totality of these points then is a diagrammatic 
representation of this dependence, in other words, of an empirical law. 

Physics frequently makes use of such representation of a multitude of 
factors influencing an event. To each of certain properties, such as tem- 
perature, pressure, time, spatial position, one dimension is coordinated, Such 
a representation in physics is called “phase space.” Such a phase space 
may have twenty dimensions if twenty factors have to be considered. A 
phase space is something definitely different from that three-dimensional 
“physical space” within which physical objects are moving. In the same way 
the psychological space, the life space or psychological field, in which 
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psychological locomotion or structural changes take place, is something dif- 
ferent from those diagrams where dimensions mean merely gradations of 
properties [95, p. 44]. 


The life space is intended to represent the concrete situation whose 
interdependent properties determine a specific psychological event. A 
phase space may be constructed to express the laws governing the prop- 
erties of life spaces. Thus, in an experiment like that of Zeigarnik [142] 
a given act of recall is conceived as being determined by the resultant 
of the psychological forces acting on the person at a given time. The law 
Suggested by Zeigarnik’s findings asserts that the magnitude of the psy- 
chological force toward recalling a particular activity is some positive 
function of the magnitude of inner-personal tension associated with the 
activity. This law may be represented by means of a two-dimensional 
phase space; the concrete situation in which a specific subject exists is 
represented by the subject's life space. The precise form of the law will 
depend upon the nature of interdependence among properties of con- 
crete life spaces. Field theorists are completely in favor of such laws 
and in the use of phase spaces for their representation, but they believe 
that the close dependence of such laws upon the properties of the life 
space has been insufficiently recognized. 

4. The principle of contemporaneity of causation raises basic issues 
concerning the meaning of “cause” and the way in which “time” is 
to be treated in science. No attempt will be made here to present a 
comprehensive analysis of these thorny problems; the reader is referred 
to the extended discussion given by Lewin [95, pp. 43-59]. It is suffi- 
cient to note that field theorists are content, in attempting to account 
for the occurrence of a concrete event, to describe the “here and now” 
and to show how the occurrence of the event is required by the nature 
of the situation. Asked to account for “why” an individual does some- 
thing at a particular time, the field theorist describes the situation in 
which the individual exists at that time. He does not describe some state 
of affairs in the individual’s past. This is not to deny that “residues” of 
the past may exist at a later time and combine, together with other 
contemporary influences, to make up the complex of determinants of 
behavior. But it is these residues, rather than the original events, which 
exert influence on later behavior. 

One fact has persistently added confusion to the psychologist’s 
thinking about the proper treatment of time in psychological theory: 
human beings are aware of time. As Lewin noted, the cognitive struc- 
ture of an individual has a time dimension, extending from the “psycho- 
logical past” into the “psychological future.” An individual’s behavior 
is oriented to both the future and the past as they exist for him at any 
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given time. He remembers, for example, that he failed at some under- 
taking in the past and expects to succeed when he tries the next time. 
The principle of contemporaneity of causation asserts that both the 
“expectation” and the “memory” exist at the moment they exert their 
influence on behavior and that the exertion of such an influence demon- 
strates neither causation from the future nor from the past. 

An illustration of the practical importance of this distinction between 
“ahistorical” and “genetic” explanations of behavior is provided by 
Ezriel [46] in his discussion of experimentation and psychoanalysis. It 
has commonly been assumed, he points out, that psychoanalysis employs 
only genetic explanations of behavior; Freud compared the analyst with 
an archaeologist who “digs up” the patient’s past. But Freud’s treatment 
of two basic phenomena, “psychical reality” and “transference,” actually 
requires use, as Ezriel shows, of ahistorical causation. 


By psychical reality Freud meant the fact that an apparent “memory” 
uncovered in psycho-analysis often turned out to be a phantasy, but one 
which was nevertheless psychologically as effective as if it had been a 
memory of a real event. This meant that what the analyst was uncovering 
was not a replica of actual events in the patient's past life, but unconscious 
structures active in the present though formed in the past out of both 
phantasies and correct or distorted memories of actual events [46, p. 32] 


Ezriel’s discussion of transference develops the same general point. 


While patient and analyst had ostensibly started their work like two 
friendly archaeologists trying to dig up the patient’s past, they were in fact 
two human beings interacting with one another, and the patient attributed 
to the analyst various roles—friend or foe, victim or persecutor, and many 
other things—according to his temporarily dominant unconscious phantasies 
in the particular session. 

There is thus no difference between analysing the content of the ma- 
terial produced by the patient, and eliciting new material by analysing the 
content of the patient’s resistance against communicating it. Both consist 
in analysing his “here and now” relations with the analyst in which the 
patient’s unconscious structures (the precipitates of unresolved conflicts 
with persons of his childhood environment) manifest themselves [46, p. 32]. 


Ezriel concludes that the traditional adherence by psychoanalysts to 
genetic, rather than ahistorical, explanations of behavior has had a 
deleterious effect on research. 

$ 

I think the main reason that no systematic experimental work has been 
attempted in the psycho-analytic situation is that analysts have not yet 
accepted the full implications of transference and psychical reality. This 
seems to be largely due to a belief that only genetic propositions can explain 
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differences between personalities and hence why various people respond 
differently to the same external situation [46, pp. 34-35]. 


Summary. In this section I have tried to indicate the major basic 
assumptions which have guided the work of field theorists along the 
paths that give it a distinctive character, Although the differences be- 
tween field theorists and others are concerned largely with matters of 
emphasis, priority, and tactics, these basic assumptions exert a pro- 
found influence upon the ways in which the work of psychologists pro- 
ceeds, 

For field theorists the most important task of psychology is to devise 
ways of treating the full empirical reality of human experience and be- 
havior in a scientific manner without doing violence to them. Accept- 
ance of this task places a responsibility on the psychologist to keep in 
constant touch with human behavior as it is found in natural situations, 
Open-minded observation should guide theorizing and serve as a check 
upon theory. But observations must be converted into descriptions, and 
the development of an adequate language of description poses one of 
the most difficult problems for psychology. The concepts employed must 
have a double-directedness, to phenomena and to other concepts. An 
“explanation” consists of describing concrete situations in such a way 
that it can be shown from the system of concepts that certain events 
“must” occur. 

The language of psychologists should be psychological. Although 
there may ultimately develop a way to describe the physical world, 
physiological processes, and psychological events in a single language, 
psychology will be deterred in fulfilling its basic task if it assumes that 
it must describe human experience in physical or physiological terms. 

The principle that “only what is concrete can have effects” leads 
to four important consequences for the student of human behavior. (1) 
Every instance of behavior is the result of the interaction of several 
features of a given concrete situation. (2) The description of human 
behavior requires inclusion of phenomena traditionally classified into 
various fields of psychology and academic disciplines. (3) A clear 
distinction must be made between the concrete life space in which be- 
havior takes place and the phase space employed to represent quanti- 
tative relations among properties. (4) Causation must be viewed as 
contemporary with the event caused. 


STRUCTURE OF SYSTEM, AN OVERVIEW 


An adequate description of any concrete psychological situation re- 
quires use of a language of constructs having both clear 
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operational definitions and unambiguous conceptual properties. Lewin 
contributed much to the building of such a language for psychology. 
He believed that the difficult task of achieving satisfactory concepts 
lies at the very center of the scientific enterprise (as we shall see in the 
later discussion of the “method of construction”). He did not believe, 
however, that it is wise to create a single, formally elegant system of 
concepts apart from the slow accumulation of empirical data. In the 
earlier stages of a science, before the formal structure of its theory is 
highly developed, it is important to use as aids to description many 
“quasi constructs” having suggestive, rather than rigorous, relations to 
other concepts. As progress is made, these terms will be gradually re- 
placed by ones more tightly tied together into a formal system. Lewin 
both introduced quasi constructs and attempted to reduce the need 
for them by evolving a coherent system of constructs. 

In this section we shall consider some of the more fundamental of 
these constructs, examine briefly how they are used by field theorists, 
and explore some of their interrelations, This treatment must neces- 
sarily be only an overview. 

Life space. The most fundamental, as well as the most widely 


known, of the Lewinian concepts is that of life space. It refers to the f 
totality of facts which determine the behavior of an individual at a 


certain moment. The essential features of this concept are described by 
Lewin in the following statements. 


One can say that behavior and development depend on the state of 
the person and his environment, B = F (P,E). In this equation the person 
(P) and his environment (E) have to be viewed as variables which are 
mutually dependent upon each other. In other words, to understand or 
to predict behavior, the person and his environment have to be con- 
sidered as one constellation of interdependent factors, We call the totality 
of these factors the life space (LSp) of that individual, and write 
B=F(PE)=F (LSp). The life space, therefore, includes both the person 
and his psychological environment. The task of explaining behavior then 
becomes identical with (1) finding a scientific representation of the life 
space (LSp) and (2) determining the function (F) which links the be- 
havior to the life space, The function (F) is what one usually calls a law 
[95, pp. 239-240]. 


In this discussion it is important to note that the term “environ- 
ment” is not strictly an “independent” variable; it both influences the 
“person” and is influenced by it. It is “psychological” and represents 
the world as it exists for the individual in question. It is important, too, 
to realize that Lewin used the term “behavior” in a very broad way 
to refer to changes in the life space. Thus, behavior might be a change 
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of location of the person in his environment, a cognitive reorganization 
of the environment, or a restructuring of the person.? 

Two aspects of the life space need to be distinguished, though they 
are empirically interdependent. These refer to its structural and dy- 
namic properties. Structure is the arrangement of the parts of the life 
space. The basic concepts of structure are region and boundary. With 
these it is possible to build up more complex concepts to describe such 
structural properties as degree of differentiation, centrality, path, and 
distance, Dynamic concepts deal with the tendencies of the life space 
to change, or to resist change. The principal dynamic concepts are 
interdependence, tension, force, field of forces, equilibrium, and power. 
Both the structural and the dynamic properties of the concrete situa- 
tion must be represented in order to understand any specific behavior. 

One further feature of the life space requires consideration, al- 
though its formal treatment was not extensively developed by Lewin. 
As noted above, the life space refers to the totality of facts which de- 
termine the behavior of an individual at a certain point in time. In one 
sense these facts are “real” and exist only at the specified time. It must 
be noted, however, that distinctions may be made among them accord- 
ing to their degree of psychological reality-irreality (for example, facts 
vs. phantasies or wishes vs. expectations) and according to their psy- 
chological time-location (one’s view of the future, present, and past). 
These properties of the life space are called “level of reality” and “time 
perspective.” Lewin suggested that they could ultimately be treated as 
dimensions of the life space. 

Several of the early empirical studies initiated by Lewin were con- 
cerned with levels of reality. From this work he drew the following 
generalizations: 


One of the most important dynamic differences between different degrees 
of reality is the greater fluidity of the more irreal levels. This greater fluidity 
shows itself in different facts: (1) Barriers in the environment offer rela- 
tively little resistance (one can do what one wants to do in irreality) ; (2) 
the boundaries of environmental regions can be shifted more easily and are 
less definitely determined . . . ; (3) a diffuse discharge of an inner-personal 
tension system occurs more quickly . . . ; (4) the boundaries between per- 
son and environment are less clear and the structure of the environment 
depends to a greater extent on the needs of the person [91, pp. 199-200]. 


*In this section I am attempting to present the major concepts as they are 
generally used, without considering at this point any ambiguities or difficulties 
they may have. A more critical evaluation of the concepts life space, person, en- 
vironment, and behavior is presented in the later section concerning the formal 
organization of the system, 
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The way in which time perspective was treated by Lewin may be 
indicated in the following quotation: 


The time perspective existing at a given time has been shown to be very 
important for many problems such as the level of aspiration, the mood, the 
constructiveness, and the initiative of the individual. Farber [48] has shown, 
for instance, that the amount of suffering of a prisoner depends more on his 
expectation in regard to his release, which may be five years ahead, than 
on the pleasantness or unpleasantness of his present occupation [95, pp. 
53-54]. 


Practically everyone of consequence in the history of humanity—in 
religion, politics, or science—has been dominated by a time perspective 
which has reached out far into future generations, and which frequently 
was based on an awareness of an equally long past. But a large time 
perspective is not peculiar to great men. A hundred and thirty billion dollars 
of life insurance in force in the United States offer an impressive bit of 
evidence for the degree to which a relatively distant psychological future, 
not connected with the well-being of one’s own person, affects the everyday 
life of the average citizen [94, p. 105]. 


Lewin believed that some of the most significant features of the de- 
velopment of an individual have to do with both time perspective and 
levels of reality. 


During development, an enlargement of the time perspective takes place. 
The small child lives in the present; his time perspective includes only the 
immediate past and the immediate future. This smallness of time perspec- 
tive is characteristic of what is usually called “primitive behavior.” The 
time dimension of the life space of the child grows with increasing age; 
more and more distant future and past events affect present behavior. 

Normal development brings with it, in addition, an increased differentia- 
tion in the reality-irreality dimensions of the life space. The young child 
does not clearly distinguish wishes from facts, hopes from expectations. The 
older person is said to be better able to distinguish between daydream wishes 
and reality, although wishful thinking is certainly very common in adults, 
too [95, p. 75]. 


Psychological environment. Region. The psychological environment 
of the life space is composed of regions surrounded by boundaries.’ If 
two regions have a boundary arc in common, they are said to be neigh- 
boring. If two regions have a common part, they are said to be overlap- 
ping. The subparts or subregions of a region are regions lying entirely 
within that region. The degree of differentiation of a region refers to the 


* In this discussion I shall not attempt to give the formal topological definitions 
of such terms as boundary and region. The “definitions” stated here are intended, 
rather, to indicate the intuitive meaning of these concepts. 
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number of its subparts. The position of a subregion is characterized by 
the region which includes it. 

The concept of region is coordinated to (1) activity, such as “eat- 
ing,” “going to the movies,” “making a decision,” or “working on a 
task”; (2) passive state, such as “being fired from a job,” “being ad- 
mired,” or “being rewarded”; (3) social entities, including such groups 
as “family,” “church,” “company,” such subparts of groups as the role 
of “father,” “bishop,” or “foreman,” and individual people; (4) the 
person whose life space is being represented. This list is not exhaustive. 
For certain purposes Lewin spoke of the “quasi-physical environment” 
which corresponds to the individual’s perception of the physical world. 
This environment is composed of parts of the geographical world, such 
as “room,” “street,” or “town,” and objects, such as “toy,” “candy,” 
or “car.” It seems promising also to deal in a similar fashion with an 
individual’s concepts so that a region is coordinated to such things as 
“democracy,” “contemporary painting,” or “equal length” [22, 141]. 
However, the exact way in which the quasi-physical environment and 
an individual’s concepts are to be related to the other meanings of 
region has not been satisfactorily worked out. 

For each type of coordination it is possible to specify a meaning for 
the representation of the person as being located in a region, Thus, if 
the person is located in the region “going to the movies,” this means 
that the person is engaging in this activity. Similarly, if he is located in 
a passive state region, this state characterizes him. And, if he is located 
in the region “church,” he has membership in that church, or if he is 
located in the region “bishop,” he occupies that office. Needless to say, 
a person may be located in more than one region at a time; he may 
go to the movies while being admired and occupying the office of 
bishop. In any given empirical situation it may be actually impossible, 
of course, to be located simultaneously in certain combinations of 
regions, such as occupying the office of bishop and being a member of 
a gang of thieves. 

Locomotion. Locomotion is said to occur if the person changes his 
location. It should be apparent that locomotion is not identical with 
physical movement of a biological organism; an individual may change 
his group memberships, for example, without changing his geographical 
location. 

Cognitive structure. The arrangement of the regions of an individual’s 
psychological environment is called his cognitive structure. If two re- 
gions are neighboring, it is possible for locomotion to take place so that 
the person goes from one to the other without passing through a third 
region. A change of structure takes place if the arrangement of regions 
changes or if the environment changes its degree of differentiation. Such 
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restructuring of the psychological environment occurs during learning 
(e.g., when a new insight occurs) and during development or regression. 

Path. A path between regions A and E is a collection of pairs of 
neighboring regions of the form, AB, BC, ..., DE, where all the 
regions A, B, C, . . . , D, E are different from each other.‘ If there is 
a path from A to E in the environment of the life space and if the per- 
son is located at A, then it is possible for the person to reach Æ (assum- 
ing a proper constellation of forces), otherwise not. It is permissible, of 
course, for there to be more than one path between any pair of regions. 
If the regions constituting a path are coordinated to activities, then the 
existence of a path indicates that it is possible to get from a starting 
activity to a terminal one by engaging in the specified sequence of inter- 
mediate activities. If an individual wants to engage in some activity but 
there is no path from his current location to that activity, then he can- 
not reach the desired activity without a restructuring of the environment. 

The length of a path may be specified as the number of boundaries 
which must be crossed in any locomotion from one end of the path to 
the other. The distance between two regions is the length of the shortest 
path between them. 

Force. The basic concept for describing the dynamic features of the 
life space is that of force. It is used to characterize the direction and 
strength of the tendency to change or to resist change. The conceptual 
properties of force are direction, strength, and point of application. The 
direction and strength of a force can be treated mathematically as a 
vector. The combination of the forces acting at a given point at the 
same time is the resultant force. The most fundamental proposition con- 
cerning force is: “whenever a resultant force (different from zero) 
exists, there is either a locomotion in the direction of that force or a 
change in cognitive structure equivalent to this locomotion” [95, p. 
256]. A major part of the work of field theorists has been concerned 
with the analysis of component forces, their determinants, and the 
effects of their combination. 

In this analysis it has proved useful to distinguish certain types of 
force. A driving force is coordinated to a tendency to change, usually to 
a locomotion of the person. A restraining force corresponds to a barrier 
or obstacle to locomotion and is effective only when some driving force 
Opposes it. Both driving and restraining forces may arise from the needs 
and abilities of an individual, from the actions of another individual, or 


é © The definitions of path, length of path, and distance given here are not iden- 

tical with those provided by Lewin [92, p. 25]. They conform more closely to the 
definitions given by graph theory [67]. The definitions which I have given here 
conform, I believe, rather closely to the meanings intended by Lewin in his less 
formalized discussions of these terms. 
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from impersonal aspects of the situation, These forces are called own, 
induced, and impersonal, respectively. 

Force field. One must usually assume that many forces exist simul- 
taneously at different regions of the life space. A force field specifies for 
each region the strength and direction of the force which would act on 
the person if he were located at that region. A region is said to have a 
positive valence if it is attractive to the person and to have a negative 
valence if it is repulsive. These terms can be defined more precisely by 
Stating that a positive valence corresponds to a field of driving forces 
where all forces are directed toward the same region and that a negative 
valence corresponds to such a field where all forces are directed away 
from the same region. It should be noted that force fields may be com- 
posed of own, induced, or impersonal forces and of driving or restrain- 
ing forces. 

Conflict of forces. A conflict situation may be defined as one where 
forces acting on the person are opposite in direction and about equal in 
strength. Any of the different types of forces may enter into a conflict 
situation. Although all conflicts have certain properties in common, im- 
portant differences in behavior arise from different combinations of con- 
flicting forces. These differences have been analyzed extensively by 
Lewin [90, pp. 66-170; 92, pp. 175-200; 95, pp. 256-269]. It will be 
possible here only to suggest the major features of this analysis. 

Conflicts involving driving forces are the familiar “choice” situations. 
The simultaneous presence of two or more valences in the life space may 
generate conflicts of various sorts, depending upon the signs of the va- 
lences, the structure of the psychological environment, the gradients of the 
force fields as distributed spatially, and the locations of the person and 
the valences, It can be shown, for example, that if a person is located 
between two mutually exclusive positive valences, he will tend to remain 
at a region between them which minimizes the distance to each. If, on 
the other hand, he is located between two negative valences, he will tend 
to try to increase the distance from both and will tend to “leave the 
field.” For this reason, the use of threat of punishment to get someone 
to do a disagreeable task can succeed only if all passable paths leading 
away from the area of conflict go through one of the negatively valent 
regions. Conflicts may also arise from the simultaneous presence of posi- 
tive and negative valences in the same direction from the person. When 
these two valences are located in the same region, we have the familiar 
cases of “ambivalence.” If the two valences are not in the same region 
as, for example, when a reward is promised for doing a disagreeable 
task, there will be a tendency to seek a path to the region of positive 
valence which does not pass through the one of negative valence. It is 
assumed that the magnitude of driving forces decreases with increasing 
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distance from the valence (the familiar concept of goal gradient), and 
field theoretical derivations concerning the points of equilibrium in over- 
lapping force fields are essentially the same as those described by Miller 
[113]. 

Conflicts between driving and restraining forces produce “frustra- 
tion” situations. It often happens that a person is surrounded by a field 
of restraining forces (barrier) such that he cannot go beyond the area 
bounded by this field, The area within such a field has been termed 
“space of free movement,” a concept of considerable use in Lewin’s 
analysis [95, pp. 130-154] of problems of adolescence and in Lippitt’s 
treatment [103] of the social-emotional consequences of different styles 
of leadership. An important difference exists between the situation in 
which the barrier surrounds the person with a positive valence outside 
and the one in which the person is outside with the valence within. In 
all cases of conflict between driving and restraining forces there arises a 
tendency to find a way to circumvent the barrier or to remove it. Under 
certain circumstances attempts to remove a frustrating barrier will take 
the form of “aggression,” thus producing phenomena of the sort de- 
scribed by Dembo [37] and by Dollard et al. [42]. Field theorists do not 
believe, however, that it is correct to assert that frustration always sets 
up tendencies toward aggression. 

The analysis of conflict is further complicated by the fact that each 
of the types of conflict discussed thus far can involve own, induced, or 
impersonal forces in all possible combinations. Of considerable interest 
to social psychology are the situations in which own and induced forces 
are in conflict. If another person O has power over the person P he 
can induce driving or restraining forces in P’s life space. Thus, if a 
person’s own needs make “going to a party” have positive valence and 
if the person’s boss asks him “to work at the office,” a conflict of driving 
forces is set up. Since O’s ability to set up induced forces in P’s life space 
depends upon O’s power over P, one resolution of such a conflict is for 
P to attempt to change O’s power over him or to get O to exercise his 
power in certain ways. Some of the most interesting problems of inter- 
personal relations arise from the conflict of own and induced forces [26]. 
Perhaps of equal interest are the situations in which the person is sub- 
jected to conflicting induced forces set up by different people. Meyers 
[112] has studied situations in which two adults give conflicting com- 
mands to the same child. One of the classic problems of social organiza- 
tion is how to minimize the conflicts which result when the same indi- 
vidual has more than one boss, Through the concepts of power and 
induced force, it is possible to treat within one set of concepts the effects 
of social organizations upon individual motivation. 

Overlapping situations, In order to describe the full complexity of 
the determinants of a given instance of behavior, it is often necessary to 
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recognize that an individual may be involved simultaneously in more 
than one situation. He may, for example, sit in a classroom and both 
listen to the teacher and plan a fishing expedition. Or, he may be in- 
fluenced at the same time by his membership in two different groups. 
The relative effect of each of several overlapping situations upon be- 
havior is indicated by the relative potency of each. In particular, the 
effect a force has on behavior is proportional to the potency of the 
related situation. 

The concepts of overlapping situations and potency have been used 
by Barker, Dembo, and Lewin [12] in their study of the constructiveness 
of primary and secondary play, by Cartwright and Festinger [22, 27] 
in their theory of decision making, by Lewin [95, pp. 143-44] in his 
discussion of social marginality as reflected in adolescence or in member- 
ship in minority groups, and by Watson and Lippitt [131] in their in- 
vestigation of the problems of maintaining identification with both the 
home culture and the host culture in programs of cross-cultural educa- 
tion. 

Person. Up to this point we have treated the person as an undif- 
ferentiated region which has a location within regions of the psycho- 
logical environment and whose locomotion is determined by forces act- 
ing upon it. In field theory, however, the person is also treated as a 
differentiated entity, having both structural and dynamic properties. 
The structural elements of the person are the same as those of the en- 
vironment, namely, region and boundary, but they are coordinated to 
different empirical data. The fundamental dynamic concept applicable 
to the person is that of interdependence of the states of regions. Lewin’s 
treatment of the person as a differentiated entity was mainly program- 
matic, except for the analysis of tension which was carried out in con- 
siderable detail. 

Simple and organizational interdependence. Lewin distinguished two 
basic types of interdependence which may exist among regions. 


1. One type of dependence, which has been called simple dependence, 
has the following characteristics. First, it is based on a process which has 
the character of “spreading” from one part to neighboring regions according 
to proximity. Second, the change of the dependent part usually occurs in the 
direction of equalizing its state and the state of the influencing part. For in- 
Stance, spreading of tension means that neighboring parts tend to change 
So that a state of equal tension is approached in all parts. Third, the 
dependence of part a on part b is essentially of the same type (although 
not necessarily of the same degree) as the dependence of part b on part a. 

2. The dependence which has been called organizational interdependence 
shows rather different characteristics. First, it is a type of dependence be- 
tween a and b similar to that between leader and led, or between someone 
using a tool and the tool. In such a case, the way a depends upon b is 
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obviously rather different from the way b depends on a. Second, the 
organizational dependence usually does not work from neighbor to neighbor 
like the spreading of tension. It is a selective process: sometimes one part, 
sometimes another part of the system is used as a tool in a specific way. For 
instance, the same need may produce an organized activity in different parts 
of the muscular system. Third, the kind of change resulting from the 
organized interdependence of a and b usually does not tend to equalize the 
state of a and b. The subordinate part b (i.e., the part which is led, the tool) 
changes in a way which helps a (the leading part) to reach its objective, 
but it does not lead to greater final equality between the two [95, pp. 
117-118]. 


The concept of oraganizational interdependence is required for han- 
dling problems of hierarchical ordering of needs, sentiments, values, those 
problems associated with the phenomena of “integration,” and those 
involving the steering of behavior. Lewin was able merely to indicate 
in broad terms the way in which such an analysis might proceed, and 
a more thorough treatment of these problems is still awaited. The con- 
cept of a simple dependence has proved to be most useful in dealing 
with the spread of need tensions. 

Inner-personal tension systems. Tension is defined as a state of an 
inner-personal region which may vary in magnitude and which displays 
a tendency to spread. More specifically, if the magnitudes of tension of 
two neighboring regions are unequal, there is a tendency for them to 
become equal. The concept of tension is coordinated to the existence of 
a need: if a need or “quasi need” (i.e., an intention) is activated, then 
the magnitude of tension of a corresponding inner-personal region is 
greater than zero. The stronger the need, the greater is the magnitude 
of tension and, according to the principle of the spread of tension, the 
larger is the number of regions under tension. Consumption, or the 
realization of an intention, reduces the level of tension. Overconsump- 
tion (satiation) results in a state that may be thought of as “negative 
tension,” which also may spread. 

The boundary of a region may vary with respect to its permeability. — 
If the boundary between two neighboring regions is not completely 
permeable, some minimal difference in the magnitudes of tension of 
the two regions will be maintained. Generally speaking, the perme- 
ability of the boundaries of a person decreases with increasing age. 
Thus, a high state of tension in a region of a child has more widespread 
effects than does the same state of tension in a region of an adult. 

Structure of the person. One of the most important of the structural 
properties of the person is its degree of differentiation, or the number of 
distinguishable parts contained in it. Many factors influence the degree 
of differentiation of a person, but in general it may be said that there 
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is a tendency for the degree of differentiation to increase with the de- 
velopment of the individual. 

Consideration of the arrangement of regions within the person leads 
to two additional structural concepts: central-peripheral and inner-outer. 
The degree of centrality of a region is inversely related to the maximal 
distance of that region from any other region within the person. Thus, 
the more central a region, the more readily will it influence all other 
regions of the person through spread of tension (assuming a constant 
value for the permeability of all boundaries) and the more readily, on 
the average, will it be influenced by other regions of the person. The 
inner-outer distinction refers to how readily a region can be affected by 
influences spreading to it from outside the person. The outermost region 
of the person has a common boundary with the outer boundary of the 
person. The innermost region lies at the greatest distance from the out- 
side of the person.’ Lewin showed that the most central region and the 
innermost region are not necessarily the same, though in some of his 
earlier writings he did not draw a sharp distinction between the two 
concepts. ‘They are used more or less interchangeably, for example, in his 
insightful description [94, pp: 18-30] of differences between personality 
types in Germany and the United States. He also used the concept of 
centrality in a more suggestive than formal way to account for certain 
differences in speed of satiation reported by Karsten [78]. Recently, 
Sanford [120] has employed the concepts central-peripheral, inner-outer, 
along with that of temporal stability, to bring about considerable clari- 
fication of the meaning of “surface” and “depth” as found in many 
discussions of personality theory. 

For completeness, a final distinction advanced by Lewin deserves 
brief mention here. In his various writings one encounters a number of 
references to the belief that the person should be conceived as being com- 
posed of “inner-personal” regions which have to do mainly with moti- 
vational tensions and a “motor-perceptual” stratum which, much like 
Freud’s ego, keeps the person in touch with the environment through 
perception and action. This distinction points to a basic problem in 
Lewinian field theory which I shall discuss at greater length in a subse- 
quent section (Formal organization of the system). 

Relations between person and environment. In order to comprehend 
the essential nature of Lewinian theory it is necessary to understand that 
Properties of the person and of the psychological environment are con- 
ceived as existing in one interdependent field. The state of the person in- 
fluences the properties of the environment, and vice versa. Specific 

“Tt should be noted that there is a technical difference between the definition 


of imner region advanced by Lewin and that suggested by Bavelas [16]. Further 
empirical work will be required in order to choose between the two. 
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instances of this general proposition can be found in virtually all areas 
of psychological investigation: motivational tensions influence cognitive 
structure and certain cognitive states may generate tensions; the activa- 
tion of a tension system produces tendencies toward locomotion of the 
person in his environment and obstacles to this locomotion may result 
in a dedifferentiation of the person—to mention but two examples. 
Lewin [92, pp. 95-108; 95, pp. 9-20, 87-128] has discussed in detail 
certain aspects of the interdependence of person and environment. I 
shall merely outline here some of the more important features of this 
treatment. 

Tension, valence, and force. The most basic assumption concerning 
relations between person and environment asserts a functional relation 
between tension in the person and valence (force field) in the environ- 
ment. When an inner-personal region is in a state of tension, an “ap- 
propriate” environmental region acquires positive valence (i.e., becomes 
the center of a force field). The environmental region “selected” de- 
pends upon the qualitative nature of the region under tension and the 
qualitative nature of the environmental region. The magnitude of val- 
ence is some positive function of tension. Lewin stated this assumption 
in the following formula: Va(G) = F(t,G). Thus, if a tension system 
corresponding to hunger is activated, an environmental region concerned 
with eating acquires positive valence. The exact way in which “objects” 
become selected has not been worked out within field theory. 

It follows from this assumption, from the definition of valence as a 
force field, and from the assumption of a goal gradient, that the magni- 
tude of force operating on the person toward some goal region is a posi- 
tive function of the magnitude of tension and a negative function of the 
distance of the person from the goal. Given a constant level of tension, 
as the person approaches a goal region the magnitude of force toward 
the goal increases. When the person actually reaches the goal, and “con- 
sumption” takes place, the corresponding tension system is released so 
that the level of tension approaches zero. 

Zeigarnik effect. A most fruitful program of experimentation to 
document this conception of tension and force became possible by as- 
suming that the intention to complete an activity generates a correspond- 
ing tension system. As Lewin sometimes stated it, an intention may be 
treated as a “quasi need.” The validity of this assumption was established 
by Ovsiankina [116] who found a marked tendency for people to resume 
an interrupted task when left in a “free” situation. Zeigarnik [142], by 
assuming that a tension system sets up not only forces to complete the 
activity but also to think about it, was able to test a number of deriva- 


tions from this theory by measuring tendencies to recall completed and 
interrupted tasks. 
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The fundamental fact established by Zeigarnik is that interrupted 
tasks are better recalled than completed ones. This follows immediately 
from the relation between tension and force and from the assumption 
that a tension is released upon completion of the task. It should be 
noted in this connection that the tendency to recall derives from the 
tension set up by the quasi need and not from the mere fact that the 
task is interrupted. Marrow [109] demonstrated that if interruption is 
made equivalent to reaching the goal and completion to not reaching it, 
then more completed than interrupted tasks are recalled. A closely re- 
lated derivation asserts that the difference in recall between interrupted 
and completed tasks should increase with increasing motivation to 
complete the tasks (greater motivation being reflected in a higher level 
of tension). This derivation was confirmed by Zeigarnik [142] and by 
Marrow [109]. 

More complicated derivations are possible if one assumes that ten- 
sion systems tend to equilibrate over time and that the rate of equili- 
bration increases with increasing fluidity of the boundaries separating 
regions under tension. The first assumption was tested by Zeigarnik 
[142] who showed that the difference between recall for interrupted 
and for completed tasks decreases if the test is made at increasing 
intervals after the initial experiment. She tested the second assumption 
by making the further assumption that fluidity of boundaries increases 
when an individual becomes more tired. Her experiment showed a 
small difference in recall for interrupted and completed tasks among 
Subjects who were tired. A closely related finding is that if the tasks are 
presented initially in such a way that each is seen as merely another 
instance of the same total activity, there is virtually no difference in 
recall between interrupted and completed tasks. This finding is con- 
sistent with the general theory if it is assumed that the boundaries 
between such tasks are highly fluid. 

Other derivations which can be made from the same general theory 
will not be discussed here. A summary is provided by Lewin [95, p. 18]. 
It should be added at this point that a large number of studies have 
been undertaken to replicate and to extend the original Zeigarnik work. 
When Zeigarnik’s original conditions have been exactly reproduced the 
same findings have been obtained. Certain reported failures to replicate 
the findings appear to be due to the fact that the experimenter did not 
actually create the conditions specified by the theory (e.g., the require- 
ment that the boundaries between the tension systems be of sufficient im- 
permeability). One important clarification has come from several studies 
Which introduced “success” and “failure” as variables in addition to 
those of “interruption” and “completion,” sometimes unintentionally 
and sometimes in a systematic fashion. It should come as no surprise 
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that if “interruption” has the meaning of “failure” to the subject, the 
tendency to recall is different from when it means “success” or has no 
relation to the ego. Lewis [100] and Lewis and Franklin [101] have pro- 
vided a way of reconciling many of the apparently conflicting findings 
through their systematic comparison of ego-orientation and task-orienta- 
tion. More recently Atkinson [8] has shown how individual measures 
of n-achievement can be used to account for differences among subjects 
in reactions to the possibility of failure. All of these developments are 
consistent, in my opinion, with the Lewinian theory, though they indicate 
needed extensions and elaborations (in particular, they point to the 
desirability of integrating the theories of tension and level of aspiration). 
These developments illustrate a basic tenet of field theory that “objec- 
tively” the same stimulus conditions will produce widely different be- 
haviors when brought to bear on differently constituted life spaces. 

Spread of tension and valence. One can derive certain interesting 
consequences from the theory of tension and valence outlined here by 
examining more closely the tendency of tension to spread. It should be 
evident that the exact course of the spread of tension (how far it spreads, 
to what regions, and how rapidly) depends upon the level of tension, the 
arrangement of regions, and the permeability of boundaries. Since it is 
also assumed that whenever a region is in a state of tension there will 
be a positive valence in a corresponding region of the environment (pro- 
vided that an “appropriate” one exists), the spread of tension will in- 
crease the number of regions haying positive valence. Moreover, the 
structural and dynamic properties of the person will affect the way in 
which new valences arise. 

A consequence of this theory is that the stronger the state of need, the 
wider, on the average, will be the realm of environmental regions having 
positive valence. This conclusion appears to be consistent with commonly 
observed facts. An individual long deprived of food, for example, finds 
many things attractive which he would ordinarily not eat, and an in- 
dividual long isolated from social interaction may be pleased to spend 
time in the company of a boor. 

The theory also provides a basis for dealing with the phenomena of 
substitution. If a barrier in the environment prevents the person from 
reaching a region of positive valence, then by means of the spread of 
tension new environmental regions will acquire positive valence. Exactly 
which regions acquire substitute valence will depend, of course, on the 
structure of the person. It has been suggested that one of the deter- 
minants of the arrangement of inner-personal regions is the relation of 
similarity among the activities or objects to which the regions refer. Cart- — 
wright [23] obtained ratings of the attractiveness of a number of activi- 
ties before and after interrupting the subject on one of them, Following 
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interruption tasks similar to the interrupted one gained in attractiveness 
much more than did dissimilar ones. The attractiveness of similar activi- 
ties was also more affected by completion and failure. In this experiment, 
at least, it appears that similarity does affect the distance between re- 
gions of the person and thus the interdependence of valences of environ- 
mental regions. 

It would be expected from the same line of reasoning that complet- 
ing an activity similar to the one originally having positive valence should 
reduce the tension in the regions corresponding to both activities. In 
other words, the similar activity should have substitute value for the 
original one. While experimental evidence provided by Lissner [106], 
Mahler [108], Sliosberg [123], and Adler [2] supports this general ap- 
proach, they make it clear that many factors in addition to similarity 
affect the interdependence among levels of tension in personal regions. 
It is clear, for example, that an activity may acquire substitute valence 
but provide little substitute value. In such a case it seems that the 
boundary between the two inner-personal regions is dynamically uni- 
directional, permitting an increase of tension in the first to raise the 
tension of the second but not permitting a reduction of the tension of 
the second to affect the level of the first. Further work on the inter- 
dependence of tension systems should lead rather easily to a clarification 
of these problems. 

Social relations. For an adequate description of human behavior it 
is essential, of course, to recognize that man lives primarily in a social 
world. In constructing the life space of a given individual this social 
world is treated in the same way as the physical world: those aspects 
which influence the individual at a given time are included in the life 
space and are represented in the form that they have for the individual. 
If an individual believes, for example, that he is a member of a group, 
this fact is contained in his life space regardless of the opinions of other 
members of the group. The task of relating different life spaces to one 
another (i.e., of constructing a social field) is important, but it is too 
complex and underdeveloped for discussion here. 

Group membership. In discussing the kinds of empirical entities that 
may be coordinated to regions of the life space, we saw that groups and 
Subparts of groups may exist as regions of the environment of a person. 
All the structural and dynamic properties of environmental regions 
discussed above may be attributes of these social regions. Thus, a group 
or subpart of a group may have positive or negative valence, or both, and 
a person may experience conflict because of the operation of such force 
fields, Similarly, all the phenomena of frustration and substitution can be 
observed with respect to a person’s position in social regions. 

The valence, or attractiveness, of membership in a group may have 
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profound effects upon an individual. The special case of enforced mem- 
bership in a group with negative valence has been analyzed in great 
detail by Lewin [94, pp. 145-158, 186-200]. The more general implica- 
tions of attraction-to-group for the theory of group dynamics are dis- 
cussed by Cartwright and Zander [29, pp. 73-91], and various tech- 
niques of measurement are described by Libo [102]. 

As noted above, forces which originate in groups or other people 
rather than in the needs of the person are called induced forces. The 
ability of a group, or individual, to induce forces in the person’s life 
space reflects the power of these social entities over the person. Ordi- 
narily, membership within a group means that the group and representa- 
tive members of the group have power to induce forces on the person. 
Such forces usually steer the locomotion of the person into activities which 
are instrumental to the achievement of group goals or are “required” by 
the norms and values of the group. If an individual “accepts” (or in- 
ternalizes) the norms or goals of a group, these set up own forces and 
tension systems within the person. Horwitz [74] has documented this 
last proposition in an experiment in which he employed the Zeigarnik 
technique with group, rather than individual, activities. 

Power. The concept of power (or power field) entered into Lewin’s 
writing almost from the beginning, although he did not give it precise 
definition until much later. He noted in his early discussion of reward 
and punishment that such situations must necessarily involve the ability 
of one individual or group to set up driving and restraining forces in 
the life space of another individual. He also employed the concept of 
power in analyzing such diverse phenomena as the reactions of children 
to a strange adult, the resistance of children to eating certain foods, the 
developmental stage of “negativity,” and the nature of various styles of 
group leadership. As field theorists have turned their efforts to developing 
a theory of group dynamics the concept of power has become of central 
importance—see, for example, Cartwright et al. [26]. 

Lewin’s formal definition of power was stated as a relationship be- 
tween two entities, a and b, which were conceived of quite generally so 
as to include the possibility of dealing with power relations between 
individuals, groups, and subparts of the life space. He defined the power 
of b over a “as the quotient of the maximum force which b can induce on 
a, and the maximum resistance which a can offer” [95, p. 336].° It 
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should be noted, in addition, that in the elaboration of this definition 
he made it clear that the power of b over a must always be specified with 
respect to the regions of the life space in which the power exists. Indi- 
vidual a may be subjected to strong induced forces from b in certain 
regions but not others, and he may be able to set up stronger resisting 
forces in some than others. The distribution of power throughout the 
life space is one of its most important features. 

A considerable amount of research has been conducted to discover 
the determinants of power. One of the best established generalizations 
was originally proposed by Festinger [49] on the basis of the findings of 
Festinger, Schachter, and Back [53] and Back [9]. Festinger states “the 
maximum force which the group can successfully induce on a member 
counter to his own forces cannot be greater than the sum of the forces 
acting on that member to remain in the group. The greater the resultant 
force to remain in the group, the more effective will be the attempts to 
influence the member” [49, p. 277]. In other words, the more an indi- 
vidual is attracted to group membership or to another individual the 
greater the power that group or individual will have. It is also clear 
that there are many other “bases” of power. These are discussed in some 
detail by French and Raven [61]. Finally, it should be noted that, given 
a constant social relationship (such as attraction or institutional author- 
ity), the magnitude of induced forces which b can set up on a will de- 
pend upon b’s skills of exerting influence. A pioneering study of this 
problem was conducted by Frank [58]. 

Some of the most important consequences of the power relations among 
people have to do with emotional security. The fact that b has great 
power over a means that b is able to influence a’s satisfaction of his 
needs: power gives ability to help or hinder. Arsenian [6], Wright [138], 
and Cartwright [25] have suggested that the security of an individual 
might be defined as a ratio in whose numerator is put the person’s per- 
ception of the magnitude of his own power plus all friendly or supportive 
power he can count upon from other sources, and in whose denominator 
is put the person’s perception of the magnitude of all hostile power that 
may be mobilized against him. Empirical documentation of the many 
Symptoms of insecurity deriving from power relations has been presented 
by Cartwright et al. [26], Pepitone [117], Hurwitz, Zander, and Hymo- 
vitch [75], and Zander, Cohen, and Stotland [140]. 

Summary. The above discussion of Lewin’s system of constructs has 
been confined to those constructs which have both received formal treat- 


force and the maximum resisting force. These revisions would lead to a definition 
of the power of b over a (with respect to a given matter) as the maximum dif- 
ference that can occur between the induced force on a set up on b and the force 
which a can mobilize in opposition to it. 
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ment and been subjected to empirical testing. There are many additional 
“quasi constructs” which have played an important role in Lewinian 
work (e.g., level of aspiration, social atmosphere, social channels, values, 
morale, and interdependence of fate). Space has not permitted an exam- 
ination of these. As field theorists have extended their research activities 
to new phenomena they have usually employed such “quasi constructs” 
rather than attempting to extend the formal system immediately. Never- 
theless, it is the belief of field theorists that as work proceeds it will be 
increasingly possible to replace the “quasi constructs” with other con- 
structs, mainly those enumerated above, which will fit rigorously into a 
formal system. A more detailed discussion of how the transition from 
“quasi constructs” to a formal system of genuine constructs should pro- 
ceed is given in subsequent sections. 


INITIAL EVIDENTIAL GROUNDS FOR ASSUMPTIONS 
OF SYSTEM’ 


Throughout Lewin’s professional career there was a continuous inter- 
action between empirical research and formal theorizing. It would be 
entirely incorrect to say either that he first developed a “system” and 
then submitted it to empirical testing or that he took a body of em- 
pirical data and built a “system” to fit. A more correct statement would 
be that, although he adopted a theoretical orientation very early in his 
career and maintained it throughout his life, he viewed the construction 
of a formal system as a gradual process to be carried out over the years 
in the most intimate association with empirical work, In this section I 
shall attempt to extract the empirical aspects from the theoretical and to 
describe in the “language of data” the principal kinds of empirical phe- 
nomena which have been investigated by field theorists. It will be nec- 
essary to limit the discussion of specific studies to those which were 
originated rather directly under Lewin’s influence. The main purpose of 

"Since Lewinian field theory does not consist of a “completed” set of con- 
structs all of which are now related to one another in a rigorously systematic 
fashion, I find it difficult to make the distinctions, suggested by the editorial com- 
mittee, between this section on “Initial evidential grounds for assumptions of sys- 
tem” and the later one on “Evidence for the system.” For this reason, I have at- 
tempted to cover both topics in this section. Because of the impossibility of making 
any indisputable distinction between field theorists and many other psychologists 
whose work is congenial to the field theoretical approach and because of the 
tremendous volume of such work, I have found it necessary to restrict the biblio- 
graphic references to a sampling of those investigations which were closely associ- 
ated with Lewin or more recently with the Research Center for Group Dynamics. 
This is grievously arbitrary and results in a most limited picture of the empirical 


work related to field theory. I can only apologize to the reader and to those whose 
work is not specifically cited. 
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this section is to convey some impression of the great range of empirical 
content deemed to be relevant to field theory. 

The research described here may be put in proper historical perspec- 
tive by noting that Lewin’s doctoral dissertation [87] was published in 
1917 and that his career was terminated by his death in 1947. For ap- 
proximately the first half of this period of thirty years, his work was 
centered in Berlin and was carried out in close association with those 
who were then developing the gestalt point of view. The most important 
publications of this era appeared in the Psychologische Forschung. Dur- 
ing the second half of his career, Lewin’s research was done in the 
United States, primarily at Cornell University, the State University of 
Towa, and the Massachusetts Institute of Technology, where he estab- 
lished the Research Center for Group Dynamics, After his death, this 
center was moved to the University of Michigan. Lewin’s theoretical 
orientation has continued to influence the research of the Michigan 
group. 

Associative bonds and dynamic processes. The earliest work of 
Lewin [87, 88, 89] consisted of an experimental and theoretical critique 
of earlier research of Ach [1] in which it had been assumed that “asso- 
ciative bonds” set up dynamic tendencies to reproduce a second item 
when a first item is stimulated. The general conclusions drawn from this 
critique were that mere connections are never causes of events. Energy 
must be set free and this derives from mental systems under tension 
(originated by needs and intentions). In these experiments the major 
variables manipulated were frequency of association, intention to recall, 
and expectations created by the sequence of items presented in the test 
series. Lewin drew upon the more dynamic emphasis of the Wiirzburg 
school and attempted to disentangle the structural and dynamic aspects 
of memory. 

Dynamic studies of memory. The natural development of this work 
led to the investigations, cited above, of the influences of tension systems 
upon recall. The principal consequence of this research was to place 
heavy emphasis upon motivational variables, and thus also structural 
Properties of the person, as determinants of memory and thinking. These 
Studies helped to shape the theoretical constructs of need tension, force, 
and interdependence among parts of the person and demonstrated the 
feasibility of conducting laboratory experiments upon motivational vari- 
ables, 

Interruption and resumption. One of the most useful of the new 
techniques was to set up in the subject a “quasi need” or intention to 
Complete some task and then to interrupt him before completion. 
Ovsiankina [116] demonstrated a marked tendency for subjects to re- 
Sume the interrupted activity if left free. She also examined how the 
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tendency to resume is affected by the particular nature of the task and 
the point of interruption in the sequence of work. Katz [80] later re- 
peated Ovsiankina’s experiment, introducing several controls and meth- 
odological refinements, and confirmed the earlier findings except for 
those concerning the effect of the point of interruption upon resumption. 
Katz found the frequency of resumption to decline steadily as the task 
was interrupted closer to completion, while Ovsiankina observed an in- 
crease when the task was interrupted very close to completion. Since the 
technique of interruption has been used mainly as a means of testing 
broad theoretical propositions, little research has been directed toward 
reconciling such detailed quantitative contradictions that have emerged 
from this line of work. As the theory of tension becomes stated more 
quantitatively, more detailed research will be required. 

There has arisen one important theoretical issue concerning the 
origin of tendencies to resume. Ovsiankina originally raised the question 
where these tendencies derive from a tension system set up when the sub- 
ject begins to work on the task or from the mere perception of an in- 
completed task. She undertook an experiment in which a subject could 
complete either a task which he himself had started or another one 
which had been partially completed by someone else. Ovsiankina found 
an overwhelming preference to complete the one which the subject him- 
self had started. Adler and Kounin [3] later reached the same conclusion 
from an experiment which avoided some methodological shortcomings 
of the earlier work. Recently, however, the issue has been opened again 
by Henle and Aull [72]. Although they obtained the same quantitative 
results as Adler and Kounin, they conclude from a careful analysis of 
the subjects’ reports that the meanings of the two tasks are so different 
to the subjects that a perceptual explanation of the results is still pos- 
sible. They correctly point out that Lewin maintained that behavior is 
a function of both the person and the psychological environment and 
that the person should not be viewed as operating on a passive environ- 
ment. The issue, then, should be stated as a problem of determining 
under what conditions a perceptual constellation does set up a tendency 
toward action. It is quite possible, of course, that such a tendency is also 
accompanied by a tension system. 

_ Substitution. The research on resumption of interrupted tasks led 
directly to a related question: can the completion of a task different 
from the interrupted one reduce the tendency to resume the interrupted 
one? In other words, can one task have substitute value for another? 
Lissner [106] conducted an experiment in which subjects completed a 
second task after being interrupted on a first and measured tendencies 
to resume the first. She found that the more similar the second to the 
first and the more difficult the second, the less the tendency to resume the 
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first. Mahler [108] investigated whether an “irreal” substitute action can 
lead to satisfaction of the need. She also used the technique of inter- 
ruption but forced the subject to complete the original task in one of 
four ways: (1) by an action different from the natural one, (2) by talking 
rather than doing, (3) by thinking out the completion, and (4) by an 
“act of magic.” She found, in general, that less real actions have less 
substitute value, but she also found many exceptions to this generaliza- 
tion. The critical question seems to be whether the intrinsic goal of the 
task has been reached. Thus, a “problem” task can be solved by talking 
or thinking just as well as by writing. If, however, the success of such 
a task requires telling the answer to the experimenter, thinking has little 
Substitute value. Magic solutions have no substitute value in a realistic 
setting, but they do have substitute value if the whole setting is magical. 
Adler [2] varied experimentally the attitude of the subject toward work- 
ing on the task, creating in one group of subjects a “concrete” and in 
one group an “abstract” attitude. In the former situation, subjects were 
“building a particular house” while in the latter they were “building 
houses.” When the concrete attitude was established, little substitute 
value resulted from completion of a second house. When the setting 
called for an abstract attitude, there was considerable substitute value, 
Provided that the subjects were old enough to maintain an abstract 
attitude. 

A different approach to substitution was initiated by Dembo [37] 
who found substitute actions arising spontaneously in a frustrating situ- 
ation. She observed, however, that such substitutes usually have little 
lasting substitute value. Sliosberg [123] systematically attempted to get 
young children to accept make-believe substitutes (for example, a piece 
of cardboard candy). When the total situation was “serious” these sub- 
stitutes were rejected, but if the situation was one of make-believe the 
Substitutes were more readily accepted. And Cartwright [23] attempted 
to measure the spread of valence to similar activities when the original 
task was interrupted. 

All of these studies, it should be noted, are dealing in an experimental 
fashion with a concept originally advanced by Freud. The earliest ex- 
perimental investigations of Freudian concepts were undertaken by field 
theorists, 

Satiation. What happens if the same activity is repeated over and 
Over again? Karsten [78] investigated this question in an experiment in- 
volving the extensive repetition of such tasks as drawing figures, reading 
a poem, or writing letters. As repetition is continued there usually occur 
first small variations in the manner of executing the task, then much 
larger Variations, then a breaking up of the activity into smaller units of 
action with a concomitant reduction of “meaning,” then the making of 
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mistakes, and finally fatigue and the appearance of “bodily” symptoms. 
Interestingly enough, if the meaning of the same activity is modified by 
embedding it in a different context, the bodily symptoms and fatigue 
often disappear. Karsten found that the speed of satiation depends upon 
the “size” of the activity being satiated (whether, for example, the task 
is to “draw a cat” or “to draw pictures”), the “centrality” or “ego- 
involvement” of the activity, and the state of the person (whether gen- 
erally tired, for instance). Pursuing this last point further, Freund [63] 
found an increase in the speed of satiation during menstruation. Sea- 
shore and Bavelas [122] report results generally similar to those of 
Karsten. 

Kounin [84] made use of a technique of “co-satiation” in which the 
effects of satiating one activity upon the speed of satiating a second 
activity are measured. In experiments on satiation and co-satiation with 
individuals of the same mental age but varying chronological age he 
found that both the speed of satiation and the degree of co-satiation 
decrease with increasing chronological age. These results are interpreted 
in terms of the degree of differentiation of inner-personal regions and 
the permeability of boundaries. Kounin advanced the concept of rigidity 
to account for these and other results, 

Frustration. One of the important earlier studies at Berlin was that 
of Dembo [37] on the effects of frustration, She gave subjects impossible 
tasks to perform and carefully observed the various symptoms of emo- 
tional tension that emerged. These included such things as anger, ag- 
gression, regression, substitution, and flights into irreality. A carefully 
controlled and quantitative study of frustration and regression was later 
conducted by Barker, Dembo, and Lewin [12] who found marked re- 
ductions in the constructiveness of play among children in a situation of 
frustration. 

The interaction of social factors and frustration was experimentally 
investigated by Wright [138, 139] who found, among other things, that 
when pairs of strong friends are subjected to frustration they display less 
reduction in constructiveness of play, less negative emotionality, more 
cooperation between themselves, and more aggression against the ex- 
perimenter than do pairs of weak friends. French [59] subjected groups 
to frustration by giving them insoluble group problems. He compared 
previously organized groups to groups created in the laboratory and 
found a greater expression of interpersonal aggression and more tenden- 
cies to leave the field in the organized groups but a greater tendency to 
group disintegration in the newly created groups. 

Level of aspiration. Among all the research topics investigated by the 
Lewinian group in Berlin, level of aspiration seems to have generated 
the largest amount of research in America. The term, level of aspiration, 
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was introduced by Dembo [37]. Hoppe [73] conducted the first experi- 
mental investigation of the phenomena. This was followed by a study 
carried out by Jucknat [76]. The important facts produced by this early 
work were that the experience of success or failure does not depend 
simply upon some “objective” standard of performance but upon the 
aspiration that the subject sets for himself, Performance falling too far 
above or below this level of aspiration does not result in feelings of suc- 
cess or failure. Furthermore, an experience of success tends to raise the 
level of aspiration for future performance, and failure tends to lower it, 
The results of this line of work also raised important questions concerning 
any simple application of the hedonistic principle because people tend 
not to set a level of aspiration which will “guarantee” success. 

In 1935 there appeared in English three articles by Frank [54, 55, 
56] which served to introduce the topic to American psychologists. In 
the years immediately following a whole flood of studies appeared. These 
were reviewed by Frank [57] in 1941. The most comprehensive survey 
of research findings and theoretical exposition of the topic was presented 
by Lewin, Dembo, Festinger, and Sears [97]. It is not possible here to de- 
scribe this large body of literature, We merely note that it includes in- 
vestigations of specific situational factors, personality traits, socioeco- 
nomic influences, reference groups, and processes of development. 
Festinger [50] has advanced a theory of social comparison processes in 
which aspiration, self-evaluation, and social communication are treated 
together. Much of the more recent research on discrepancies between 
ideal self and actual self has been influenced by the earlier work on level 
of aspiration. 

Decision, The emphasis in field theory upon psychological forces and 
their interaction has stimulated various investigations of conflict, choice, 
and decision. Despite the obvious differences in terminology much of this 
work has dealt with the same phenomena as those reported by Miller 
[113] in his discussion of types of conflict and those treated by Tolman 
[128] under the label of vicarious trial and error. Several studies have 
been directed toward investigating the determinants of decision-time. 
Escalona [43] tested certain theoretical derivations concerning decision- 
making among manic-depressive patients. Wright [137] found that chil- 
dren who were inconsistently altruistic or egoistic took longer to make 
decisions involving these values. Jucknat [76] and Barker (10, 11] 
demonstrated greater indecision the stronger the valences of the alter- 
Natives, the greater the reality level of the choice, and in choices between 
Negative (as compared to positive) alternatives. Cartwright [22] and 
Cartwright and Festinger [27] showed how decision-time depends upon 
the differentiation of the cognitive field and upon an attitude of cautious- 
ness, 
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The term, group decision, was employed by Lewin [96] to describe 
the situation where individuals make a decision in a group situation con- 
cerning their own future behavior. In a number of experiments it was 
shown that making such a decision appears to “freeze” the intention 
so that the behavior is more likely to occur than if no such decision 
takes place. Bennett [18] has recently investigated some of the many 
variables simultaneously manipulated in a “group decision” and con- 
cludes that the act of making a decision and the perception that a ma- 
jority of other people are also making the same decision account for the 
effect. 

Social perception. The term, social perception, has come to refer to 
two rather different sorts of phenomena: the influence of social variables 
upon perception and the perception of social objects. Although field the- 
orists have not conducted much research upon the social determinants of 
perception, the basic proposition that the person and psychological en- 
vironment constitute an interdependent field has predisposed them to ac- 
cept readily the so-called new look in perception advanced by Bruner 
and others [19]. Lewin’s various discussions of the determinants of the 
“meaning” of perceived objects and events always stressed the impor- 
tance of social variables. 

Considerably more research has been done on the perception of social 
objects. This work has been greatly influenced by Heider [70, 71] and by 
Asch [7]. Kelley [81], for instance, demonstrated how an expectation 
that an individual will be “warm” or “cold” influences the actual per- 
ception of the individual and the readiness to interact with him. Pepitone 
[117] constructed an experiment to examine some of the determinants 
of the perception of social power and approval. Thibaut and Riecken 
[126] studied a situation in which the subject is confronted with a person 
having relatively high or relatively low power who complies to an in- 
fluence attempt made by the subject. They found that the perceived 
causal locus for compliance is “internal” to high-power and “external” 
to low-power individuals. And Festinger, Pepitone, and Newcomb [52] 
investigated a group phenomenon which they call deindividuation in 
which members of a group do not pay attention to other individuals 
qua individuals. Under such conditions individuals do not relate events 
taking place in the group to specific individuals. 

Values. When Lewin moved from Germany to the United States 
one of the things that impressed him most was the extreme difference 
between the two countries in certain of the values held by residents of 
each. Lewin [94, pp. 3-68] wrote several articles outlining the nature of 
these differences and developing a theory of how to change cultural val- 
ues. The problem of developing techniques for doing quantitative re- 
search on cultural values has proved to be most difficult, Bavelas [14] 
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and Kalhorn [77] devised an ingenious technique for discovering the 
content of values and the social surrogates among school children. White 
[133, 134] developed a scheme for analyzing the content of documents 
in order to reveal the values expressed in them. And Wright [137] 
studied the ontogenetic development of the value of “fairness” by means 
of an experiment with children in which the allocation of desirable toys 
between themselves and others had to be made. 

Phenomena related to the concepts of group standard or group norm 
have been studied rather intensively. Lewin [95, pp. 188-237] showed 
how a wide variety of such phenomena could be analyzed as quasi- 
stationary equilibria. Coch and French [35] employed this approach in 
a field experiment in which group standards concerning work output in 
a factory were changed. A large number of studies were stimulated by a 
field study of Festinger, Schachter, and Back [53] in which the develop- 
ment of group standards in a housing project was documented. Sum- 
maries of this work may be found in Festinger [49] and Kelley and 
Thibaut [83, pp. 765-770]. 

Authority and social influence. Among the many differences which 
Lewin noted between Germany and the United States he was partic- 
ularly interested in styles of leadership and the social-emotional atmos- 
phere of groups. In a characteristic fashion this interest was converted 
into experiments contrasting the effects of different styles of leadership. 
The results, reported by Lewin, Lippitt, and White [98] and Lippitt 
[103], have had a deep influence both upon social psychological research 
and upon practices in education, social group work, and administration. 
One of the first repercussions of this research was increased acceptance 
of the view that leadership consists of behaviors that may be developed 
and taught. Bavelas [15] reported dramatic improvements in the func- 
tioning of groups as a result of training leaders in the skills of democratic 
leadership. Interest in leadership training rapidly developed in many 
directions. Of these I can only mention the experimental workshop 
established for training community leaders which was evaluated by 
Lippitt [104]. The techniques of training and the philosophy of com- 
bining training with research which characterized this workshop were 
then employed in establishing the National Training Laboratories in 
Group Development which holds annual programs at Bethel, Maine. 

Research on the determinants of leadership in children’s groups has 
been carried out in summer camps, classrooms, and the laboratory by 
Lippitt, Polansky, Redl, and Rosen [105] and Grosser, Polansky, and 
Lippitt [66]. This work has been able to distinguish acts of leadership 
which are deliberate attempts to influence others from those which exert 
their effects through a process of emotional contagion. 

Leadership and social influence more generally usually occur in 
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some social setting possessing a structure. Individuals have positions 
within such structures and much of their behavior is influenced by the 
relations among these positions, Several studies have been conducted 
by field theorists to examine how these “sociological” variables interact 
with more traditional “psychological” ones in determining the actions 
and feelings of individuals. French and Snyder [62] explored the ways 
in which the opinions and attitudes of enlisted men affect the ability 
of noncommissioned officers to influence their behavior. Zander, Cohen, 
and Stotland [140] studied the working relations among psychiatrists, 
clinical psychologists, and psychiatric social workers in order to discover 
how legal and institutional relations among the professions affect the 
attitudes and behavior of individual members of these groups. Cohen 
[36] conducted an experiment in a business office in which the super- 
visor’s behavior toward subordinates was systematically varied so as to 
provide a well-structured or poorly structured environment for the sub- 
ordinates. The dependent variables in this study were many symptoms 
which may be taken as reactions to threat. Kelley [82] created a status 
hierarchy in the laboratory and investigated the ways in which this 
influences the direction of the flow and the content of communication. 
And Bavelas [17] stimulated a large number of experiments in which the 
communication channels within a group are systematically controlled 
and varied while various aspects of group functioning are observed. 

Cooperation and competition. As field theorists have turned their 
attention to interpersonal relations they have become interested in the 
ways in which various institutional arrangements influence the motiva- 
tional interdependence of people. Deutsch [39, 38] compared the conse- 
quences of organizing college classes so that the students were coopera- 
tively or competitively related to one another. He found marked differ- 
ences in such things as interpersonal liking, readiness to be influenced, 
coordination of efforts, productivity, and comprehension of communica- 
tions, Lewin, in discussing the consequences of membership in minority 
groups, often pointed out that group membership can result in the mem- 
bers’ sharing a common fate. Thibaut [125] set up an experiment in 
which some of the consequences of such a state of affairs could be 
recorded. 

Intergroup relations. The rise of Hitler and the existence of prejudice 
and discrimination against minority groups in the United States stim- 
ulated Lewin to become deeply concerned with such phenomena. He 
wrote extensively on the problem [94] and took steps which led to the 
establishment of the Commission on Community Interrelations of the 
American Jewish Congress. This research organization conducted a 
number of basic studies of the dynamics of prejudice and techniques for 
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improving intergroup relations. One of the most imaginative approaches 
to these problems was a series of experiments, conducted by Citron, 
Chein, and Harding [33] and by Citron and Harding [34], in which 
techniques were developed for handling antiminority remarks and for 
training citizens to use these techniques. This organization also made 
extensive use of the technique known as “action research,” one of the 
best illustrations being that of the community self-survey as described by 
Wormser and Selltiz [136]. When the Research Center for Human Rela- 
tions was established at New York University, this line of work was 
continued there. Among the many important studies conducted by this 
group special mention should be made of the investigations of attitudes 
in interracial housing reported by Deutsch and Collins [41] and Wilner, 
Walkley, and Cook [135]. 

Food habits. As field theorists have broadened their interests in at- 
tempting to account for the total field of determinants of behavior, their 
research has become increasingly interdisciplinary. A good example of 
this trend may be seen in the research of Lewin [93] on the question of 
why people eat what they do. On the basis of this work he constructed 
a theory of social channels and gatekeepers in which he analyzed the 
ecological processes by which food gets onto the table. From this analysis 
it was possible to designate certain individuals as gatekeepers whose 
decisions are crucial in determining what other people eat. A number of 
cultural, economic, and technological variables were shown to influence 
these decisions. 

Summary. From this sketchy survey of research conducted by Lewin 
and those closely associated with him, it should be apparent that field 
theorists aspire to deal with a very large range of psychological phe- 
nomena. The settings in which research occurs and the kinds of people 
Studied are, moreover, extremely heterogeneous. Such a breadth of re- 
Search activity reflects the field theorist’s commitment to attempt to 
understand the total life space or field of determinants of behavior and 
an unwillingness to restrict his interests to some limited kind of behavior. 
He cannot accept in principle any view of psychology that would create 
a categorical separation, or discontinuity, between theories of learning, 
Perception, motivation, personality, childhood, adolescence, maturity, 
Social relations, or any other classification of human behavior and ex- 
Perience. He believes that psychological theory should resist the develop: 
ment of any such separations and should contribute to the understanding 
of people of all ages and in all possible states and circumstances. 

A minimal prerequisite for the achievement of such a theory is the 
creation of a language of description applicable to the entire domain of 
Psychology, 
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METHOD OF CONSTRUCTION? 


How is psychology to acquire an adequate language? The proposal 
that psychological theory be stated in the language of physics or physi- 
ology has been rejected by field theorists, as noted above. Psychology, 
then, confronts the critical task of building for itself an adequate set of 
concepts, and psychologists are deeply divided on how to proceed. Lewin, 
in seeking guides for his own development of concepts, relied heavily 
upon the conclusions reached by Cassirer [30] from his extensive analysis 
of the history of science. The implications of this analysis for psychology 
were elaborated by Lewin in his paper on the conflict between Aristo- 
telian and Galileian modes of thought in psychology [90, pp. 1-42]. 

Cassirer maintains that the crucial feature of the advance of natural 
science has been a change in the kind of concepts used. Scientific prog- 
ress has consisted, in large measure, of moving from “thing-concepts” to 
“relation-concepts,” or from the method of “abstraction” to that of “con- 
struction.” This diagnosis of the source of scientific progress has had a 
fundamental influence on the thinking of psychological field theorists. 

Abstraction and construction. The essence of the distinction ad- 
vanced by Cassirer can be stated most concisely by a quotation from 
Weyl’s discussion of the philosophy of mathematics and natural science. 


Aristotle ascends from the single object to the concept by isolating in- 
dividual features of the object and by “abstracting” from everything else. 
Thus every other object which exhibits those same features falls under the 
same concept, or into the same class. In this procedure (as in descriptive 
botany or zoology) only the really existing objects are concerned, and classes 
are formed preferably in such manner that, according to the testimony 
of experience, the concepts entail as many “connotations” as possible. In the 
mathematical-physical or “functional” formation of concepts, on the other 
hand, no abstraction takes place, but we make certain individual features 
variable that are capable of continuous gradation, and the concept does 
not extend to all actual, but all possible objects thus obtainable . . . [132, 
p. 150]. 


The method of abstraction, as Cassirer points out, has appealed to 
those who are empirically oriented because it has the apparent advantage 
of making very few assumptions about empirical reality. 


® At this point I find it necessary to depart rather markedly from the outline 
provided by the editorial committee. The material presented here is most closely 
related, I believe, to the committee’s section on “Construction of function forms.” 
Before functional relations among variables can be treated productively, it is nec- 
essary to achieve a satisfactory designation of the variables. Since field theory is 
greatly interested in the prior question of how to construct concepts, I have taken 
the liberty of substituting a discussion of the construction of concepts in place of 
one concerning the construction of functions. 
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Nothing is presupposed save the existence of things in their inexhaustible 
multiplicity, and the power of the mind to select from this wealth of 
particular existences those features that are common to several of them. 
When we thus collect objects characterized by possession of some common 
property into classes, and when we repeat this process upon higher levels, 
there gradually arises an ever firmer order and division of being, according 
to the series of factual similarities running through the particular things. 
The essential functions of thought, in this connection, are merely those of 
comparing and differentiating a sensuously given manifold [30, pp. 4-5]. 


Although this policy of nonintervention on the part of the scientist has 
undoubtedly made the method of abstraction attractive to those who 
highly regard empirical data, it entails certain unfortunate consequences. 
We quote Cassirer again. 


If we merely follow the traditional rule for passing from the particular 
to the universal, we reach the paradoxial result that thought, in so far as it 
mounts from lower to higher and more inclusive concepts, moves in mere 
negations. The essential act here presupposed is that we drop certain 
determinations, which we had hitherto held; that we abstract from them 
and exclude them from consideration as irrelevant. What enables the mind 
to form concepts is just its fortunate gift of forgetfulness, its inability to grasp 
the individual differences everywhere present in the particular cases. . . . 
If we adhere strictly to this conception, we reach the strange result that all 
the logical labor which we apply to a given sensuous intuition serves only to 
Separate us more and more from it. Instead of reaching a deeper compre- 
hension of its import and structure, we reach only a superficial schema from 
which all peculiar traits of the particular case have vanished [30, pp. 18-19]. 


The basic difficulty with the method of abstraction, Cassirer con- 
cludes, lies in an error it makes in its very attempt to stay close to em- 
pirical data. 


It is only owing to the fact that science abandons the attempt to give 
a direct, sensuous copy of reality, that science is able to represent this reality 
as a necessary connection of grounds and consequents [30, p. 164]. 


As soon as we take one step beyond the first naive observation of isolated 
facts, as soon as we ask about the connection and law of the real, we have 
transcended the strict limits prescribed by the positivistic demand. In order 
even to indicate this connection and law clearly and adequately, we must 
80 back to a system that develops only universal hypothetical connections 
of grounds and consequences, and which renounces in principle the “reality” 
of its elements [30, p. 117]. 


Inertia is for Galileo—as empty space was for Democritus—a postulate 
that We cannot do without in the scientific exposition of phenomena, but 
Which is not itself a concrete sensible process of external reality. It denotes 
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an idea, conceived for the purpose of ordering the phenomena, yet not 
standing on the same plane methodologically with these phenomena [30, 
p- 169]. 

A fundamental issue, then, lies at the basis of the distinction between 
abstraction and construction. The elements of abstraction are phenomena 
or objects of the scientist’s experience; the elements of construction are 
ideas. The method of construction rests on an apparent paradox: in 
order to develop concepts adequate to describe empirical reality we must 
construct them by an act of thought. The ideal concepts of natural sci- 
ence “go beyond the given, in order to grasp more sharply the systematic 
structural relations of the given” [30, p. 128]. Viewed from this perspec- 
tive, the so-called conceptual properties of constructs carry an essential 
rather than any “surplus” meaning. Operational definitions serve to 
coordinate constructs to specific empirical data, but conceptual defini- 
tions place them in a system of ideas so that they may be manipulated 
in accordance with the laws of logic and so that the necessary connec- 
tions of grounds and consequents displayed in the empirical data can 
be grasped. The operational and conceptual definitions of a construct, 
moreover, cannot be successfully developed independently of each other 
because the operations, or measurements, used in any particular defini- 
tion must fit the requirements of the conceptual properties of the con- 
struct. Only when both operational and conceptual definitions are given 
is it possible for a construct to represent the particularities of individual 
cases and to show the necessity of the occurrence of these particularities. 
A test score on an intelligence test, for instance, may provide some 
actuarial information about an individual through knowledge of em- 
pirical correlations between test scores and other characteristics of people, 
but unless the test score can be related to a construct (or set of con- 
structs) with conceptual properties and thus to a logical system it is not 
possible to understand why variations in intelligence must be related to 
other characteristics of the individual. 

Two important consequences flow from an adequate understanding 
of the utility of conceptual definitions. Both have to do with the place 
of statistics in the psychologist’s methodological armament. It is more 
than an interesting historical fact that Galileo and Newton did not pro- 
ceed according to approved statistical procedures; they employed neither 
a representative sample nor a factor analysis. Indeed, it may be argued 
that it is fortunate that modem statistics was not available to them, for 
the classical branches of natural science have rarely used statistics except 
as a tool for estimating “errors of measurement.” The first big advance 
in physics came when it was recognized that laws do not “summarize” 
the distribution of observed events but state necessary relations among 
all “possible”? events. Applied to the contemporary psychological scene 
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this experience shows, I believe, that one can “account for? variance 
among data in only a limited sense so long as one employs exclusively 
operational definitions. If one wants to “explain” variance, he must use 
terms having conceptual as well as operational definitions, I believe, too, 
that it is at least an intuitive awareness of the difference between 
“frequency of association” and “law” (and not merely pragmatic con- 
siderations) that has kept even the most “scientific” psychologists from 
adopting representative sampling as a sine qua non of experimentation.” 
Once a law has been proposed, the psychologist is interested in testing 
it against all conditions included under the law whether these occur 
frequently or not. 

In recent years psychologists have discussed with considerable heat 
the distinction advanced by Allport [5] between ideographic and nomo- 
thetic approaches [47] or between clinical and statistical prediction 
[111]. To the field theorist the issue is not properly stated. He agrees 
with the ideographic or clinical approach in stressing the necessity for 
representing the full complexity of individual cases and accepts the 
Proposition that no two people are exactly alike. He agrees, too, that the 
frequent occurrence of an event is a poor guide to predicting its occur- 
rence in the future. But he does not accept the view that unique or 
infrequent events are unlawful. He believes that by employing concepts 
defined through the method of construction and by establishing func- 
tional relations as laws it is possible to demonstrate the necessity of 
“unique” events. Lewin stated the situation in the following way: 


If one represents behavior or any kind of mental event by B and the 
whole situation including the person by S, then B may be treated as a func- 
tion of $: B =f($). In this equation the function f, or better its general 
form, represents what one ordinarily calls a law. If one substitutes for the 
variables in this formula the constants which are characteristic for the indi- 
vidual case one gets the application to the concrete situation. 

The determination of the laws is therefore only one side of the task of 
explaining mental life. The other side, which is of equal importance and 
inseparably connected with the determination of the laws, involves the task 
of representing concrete situations in such a way that the actual event can 
be derived from them according to the principles which are given in the 
general laws [91, p, 11]. 


Field theorists hold, then, that the development of adequate con- 
cepts in psychology will be accomplished by use of the method of con- 
struction. A limited number of “elements of construction” are needed 

° This comment about representative sampling does not, of course, reflect any 
lack of appreciation for the importance of proper sampling as a technique for 
estimating the incidence of properties or events in a parent population, What is 
at issue is the proper use of information about the distribution of data in the 
Population, 
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which, together with laws governing their relations, can serve to repre- 
sent all of the individual phenomena of interest to the psychologist. 
This is the task to which Lewin devoted himself and which remains to + 
be completed. 


In psychology, one can use psychological “position,” psychological 
“forces,” and similar concepts as elements. The general laws of psychology 
are statements of the empirical relations between these constructive elements 
or certain properties of them. It is possible to construct an infinite number 
of constellations in line with those laws; each of these constellations cor- 
responds to an individual case at a given time. In this way, the gap between 
generalities and specificities, between laws and individual differences, can 


be bridged [95, p. 61]. 


A good example of the method advocated here is Lewin’s classical 
analysis of “types of conflict” in which he was able to show how the 
various types could be constructed from certain constellations of psycho- 
logical forces [90, pp. 114-170]. Miller’s treatment of the same problem, 
although using different elements of construction, also illustrates the 
procedure [113]. 

In this connection, it may be useful to note that the “elements of 
construction” are essentially the “primitives” of a mathematical system. 
It must be added, however, that a term like “psychological force” may 
serve as an element of construction, or primitive, in constructing higher- 
order concepts like those of “equilibrium” or “central force field,” 
whereas from another point of view it may itself be constructed from 
other primitives like “magnitude,” “direction,” and “point of applica- 
tion.” The ideal of the formal theorist is to achieve a small number of 
primitives from which it is possible to build up a large system of higher- 
order constructs. In the construction of a system of concepts to be used 
in empirical research, however, it is perfectly proper to employ as 
elements of construction terms which may at a later stage of develop- 
ment be constructed from other elements. 

Commensurability. A major advantage of employing the method 
of construction is that it provides, in principle, a commensurate system 
of concepts. The empirical psychologist is overwhelmed by the multi- 
plicity of psychological phenomena and by their apparent qualitative 
diversity. The theorist, moreover, is confronted with a heterogeneous 
collection of descriptive terms which have no clear relationship to one 
another. Field theorists hope through the use of proper procedures in 
the building of constructs to reduce this chaos and to achieve a com- 
mensurate system of concepts. Leeper has stated this point clearly in his 
critique of Lewin’s work. 


It is easy to say that we need always to recognize that behavior is 2 
product. not only of motives and habits, but also of the complex structure 
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of the particular present situation. But it is a difficult thing, practically, to 
conceive this total assortment of causal factors in such a fashion that we can 
really relate behavior to all of these different elements. We have meant to 
do so. But we have lacked the intellectual tools required to secure effective 
command over such complex materials. Part of the difficulty comes because 
the various factors in motivational situations seem to be qualitatively dif- 
ferent, incommensurable in nature. It is as if we were asked to add together 
three quarts of milk, six pounds of rice, four yards of silk, and ten amperes 
of electric current. It is the contention of Lewin, however, that the factors 
are not really incommensurable. There is actually an interaction between 
them in a human life. He has tried, therefore, to develop constructs that 
would put all the factors into commensurate terms so that we might effec- 
tively take into account the situational elements as well as the factors of 
motive, habit, and response [85, pp. 29-30]. 


The achievement of a coherent system of concepts requires, field 
theorists believe, utilization of the method of construction. Concepts 
must be constructed from a limited number of undefined terms (primi- 
tives) and a few postulates concerning their “properties.” The concepts 
are to be defined in terms of the primitives. By allowing only such con- 
Structs in the language of theory, one achieves a coherent conceptual 
system in which it is clear how each concept relates logically to each 
other one and of what “conceptual type” each concept is. Until this is 
achieved there can be little hope of obtaining quantitative laws. It has 
always been remarkable to me how many apparently different phenom- 
ena Lewin was able to describe by means of a limited number of 
elements of construction. Some impression of how this feat was accom- 
plished may be gained from his interesting discussion of the conceptual 
dimensions of his various constructs, 


1. Position is a “spatial relation of regions. . . .” Examples of psycho- 
logical Concepts which have the conceptual dimension of position are: group 
belongingness of an individual, his occupational position, involvement in an 
activity, 

P 2. Locomotion has a different dimension from position. It refers to a 
relation of positions at different times.” Any psychological phenomenon 
that can be represented as a locomotion—and that holds for most “behavior” 
would have the same conceptual dimension. 2 
» Cognitive structure might be regarded as having the same dimension 
=> Position because it refers to the relative position of different parts of a 
field. Structure does not refer, however, to the position of one point but to 
the position of a multitude of points or regions. 

4. Force or “tendency to locomotion” has conceptually a different char- 
acter from actual locomotion, although locomotion is one of the symptoms 

Operational definition) for a constellation of forces where the resultant 
Otce is greater than zero. . . . : 
» A goal has the conceptual dimension of a “force field”—that is, of a 
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distribution of forces in space. Goal (or in field-theoretical terminology, a 
positive valence) is a force field of a special structure, namely, a force 
field where all forces point toward the same region. To conceive of a goal 
in this way gives it a definite place within the totality of possible patterns of 
force fields. The counterpart of a distribution of forces toward one region 
is the distribution away from one region. This is equivalent to the concept 
of “aversion.” Other types of force fields are equivalent to what is called 
a “difficulty” or “barrier.” The transformation of such everyday concepts as 
goals, difficulties, aversions into force fields of different types makes it pos- 
sible to link these qualitatively very different entities in a way which lays 
open their functional similarity and differences. 

6. Conflict refers not to one force field but to the overlapping of at least 
two force fields. “Frustration” has the same dimension as conflict. A system- 
atic survey of the possible types of frustration or conflict should, therefore, 
inquire how force fields can overlap in such a way that equally strong but 
opposite forces result at some points of the field. Such analysis permits a 
systematic treatment of the conditions and the effects of conflicts. The con- 
cept equilibrium has the same dimension as conflict; it refers to certain 
constellations of overlapping force fields. 

7. Fear may seem to have the same dimension as aversion. However, in 
most cases fear is related to the psychological future. It has to deal with 
some aspect of “time perspective.” In this respect it is similar to concepts 
like hope, plan, expectation. Expectation refers to the psychological structure 
and the distribution of forces on the reality level of the psychological future. 
Hope refers to a relation between the structure of the reality level and of 
the wish level of the psychological future. Guilt refers to the relation be- 
tween the structure of the reality and wish level of the psychological past. 

8. The concept of power refers to a “possibility of inducing forces” of 
a certain magnitude on another person. The concept of power field, there- 
fore, does not have the same conceptual dimension as that of a force field. 
In using concepts like attack, defense, aggression, friendship, one has to be 
aware of the different dimensions of the concepts: power field, force field, 
force and behavior. 

9. Values influence behavior but do not have the character of a goal 
(that is, of a force field). For example, the individual does not try to “reach” 
the value of fairness but fairness is “guiding” his behavior. . . . Values are 
not force fields but they “induce” force fields. That means values are con- 
structs which have the same psychological dimension as power fields. It is 
interesting to consider from this point of view the psychoanalytical theory 
that values are “internalized” parents. Independent of whether this state- 
ment in regard to the genesis of values is or is not correct we can at least 
say that values and persons are equivalent in so far as both can be repre- 
sented by power fields [95, pp. 39-41]. 


Constructs and quasi constructs. At this point a question of tactics 
is in order. It is very well, one may say, to assert the need for a system 
of concepts, defined in terms of a limited number of elements of con- 
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struction and clearly ordered according to conceptual type, but how does 
psychology go about achieving this? Is it not necessary to engage in a 
great deal of pure description before we know enough to invent the 
correct system of constructs? The Lewinian answer is a qualified “yes.” 
Even though Lewin felt confident of the value of the system of concepts 
he built up from such elements of construction as force and region, he 
found it necessary to employ a number of quasi constructs, like level of 
aspiration, social atmosphere, or gatekeeper, which could not be defined, 
when introduced, in terms commensurate with the more formal system.?° 
But it is clear that he considered these only as starting points. His discus- 
sion of the “frustration-aggression” research of Dollard et al. [42] is 
illuminating in this respect. 


In SR theory “frustration” is treated as a “concept,” as an “element of 
Construction.” The attempt is made to define this concept operationally and 
to proceed from there to a quantitative theory, for instance, about the rela- 
tion between frustration and aggression. When the psychologist who follows 
field-theoretical lines speaks about frustration, learning, hope, friendship, 
aggression he is conscious of the fact that he is using “popular terms.” 
These terms are quite helpful, even necessary, in the beginning. However, 
they are not considered, within field theory, as psychological concepts in the 
Sense of scientific “elements of construction.” The reason for this is that a 
term like “frustration” (a) lacks a conceptual definition through coordina- 
tion to mathematical concepts, (b) refers in a vague way to a multitude 
of different settings rather than to one conceptually definable type of 
Situation, 

d If this is correct, it would be scientifically meaningless to attempt, for 
Instance, to link the intensity of frustration lawfully with any specific effect 
(such as aggression) ; for one would have to know the type of frustration 
and the detailed setting in order to make any definite derivations, Indeed, 
the experiments show that it is as correct to say “frustration leads to in- 
creased friendship and nonaggression” as it is to say “frustration leads to 
aggression.” It is correct to say that frustration leads to increased as well as 
to decreased productivity, that it leads to new efforts as well as to passivity. 

The field-theoretical approach in this point is more radical. Its higher 

emands on concepts can be formulated in the following way: Psychology 

should be as much concerned with the question of what frustration “is” 
Psychologically, as with the effect of frustration. In fact, field theory con- 
Siders it impossible to investigate the laws of frustration, hope, friendship, 
or autocracy without investigating at the same time what frustration, hope, 
friendship, or autocracy “is” psychologically. 

I am well aware that questions about the “nature” of objects or events 


“Tt is interesting in this connection to recall that when Lewin first introduced 
the term “valence” it had reference only to the “demand character” of objects and 
activities, Only after several years did he define valence in terms of forces (i.e., 
as a central force field) and thus place it unambiguously in a system of concepts. 
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have been much abused and have been asked in a scientifically meaningless, 
metaphysical way. . . . There is one meaning behind the question about 
the “nature” of things which is as essential for psychology as it is for science 
in general. If a chemist finds a certain material he may be able to define it 
operationally by pointing out where it can be found and by specifying its 
color and weight. In studying this material, the first question will be: “What 
is this material chemically?” He might find that it is an element or a com- 
pound, or he might find that the chemical constitution of this material 
varied from piece to piece (as much as the psychological nature of frustra- 
tion varied from occasion to occasion). In this case, the material, in spite 
of its being well-defined operationally, does not represent “one type” from 
the point of view of the chemist. The criterion for this oneness is the 
possibility of representing it by one chemical formula, through one combina- 
tion of “elements of conceptual construction” (such as ions, atoms). What 
an object is is now determined by the possibility of characterizing it by one 
combination of conceptual constructs [95, pp. 34-36]. 


At any given stage, then, the complete body of psychological theory 
will necessarily contain quasi constructs which do not yet fit rigorously 
into the more formally developed parts of the theory. The admissibility 
of such terms is essential if psychology is to avoid the danger, referred 
to above, of allowing formal systems to define the field of psychology. It 
should be recognized, however, that such quasi constructs serve as pro- 
grammatic guides for next steps in conceptualizing. Quasi constructs, 
like “need,” “ego,” “attitude,” “ability,” or “learning,” should be wel- 
comed by psychological theorists, for they are important way stations 
along the route to rigorous theory, but they should also provide a chal- 
lenge because they cannot be included in rigorous derivations. 

Method of successive approximation. The psychologist is not in- 
terested only in conceptually rigorous constructs; he is also concerned 
with the empirical relations among them. In implementing this concern 
he may proceed, however, in various ways. One line of procedure is to 
select two or three properties, hold constant or randomize all others, and 
then concentrate on determining precisely the equation specifying their 
functional relations. The emphasis placed on exact quantification makes 
this procedure attractive to many “scientifically” minded psychologists, 
but the procedure contains some difficulties which make the field theorist 
doubt its desirability. The crux of the problem lies in the selection of 
properties. Which ones are to be chosen? Surely they are not selected at 
random. A general map is needed to show the full array of interde- 
pendencies before undertaking a precise quantitative determination of 
the relations among a limited number of properties. Research should 
proceed by a series of successive approximations, beginning with a gross 
determination of “what goes with what” and moving progressively to 4 
more and more exact specification of the form of the functional rela- 
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tions. The first approximation is a simplification of the empirical situa- 
tion in that it ignores the quantitative details, but it is acceptable in that 
“the representation given in the first approximation will not be destroyed 
but only made more articulated by the second approximation since the 
whole situation is taken into account from the beginning” [91, p. 17]. 
Lewin points out that the early experiments on tension systems deliber- 
ately attempted such a first approximation and he credits this procedure 
for the fact that more precisely quantitative experiments carried out 
subsequently have not forced any basic change of the original theory. 


Either one tries to determine the actual percentage of resumption as 
accurately as possible, or one is mainly interested in the question whether 
the effect of an intention can be adequately understood as the creation of 
a tension system. For the latter question it is at present of minor importance 
whether the percentage of resumption is 75, 80, or 85, because any of these 
figures would be in line with the general assumption. To prove or disprove 
the theory of tension systems, it seems much more important to find a 
variety of derivations from this theory which should be as different as pos- 
sible from each other, and to test as many as possible of these derivations, 
even if this test should be rather crude quantitatively at the beginning [95, 
pp. 8-9]. 


The fact that it is possible to employ this method of successive ap- 
Proximation while using concepts which are built up rigorously from 
elements of construction sheds some light on the distinction between 
intervening variables and hypothetical constructs advanced by Mac- 
Corquodale and Meehl [107]. Without entering into a detailed dis- 
cussion of this problem, I should like to suggest that Lewinian concepts 
are more properly classed as hypothetical constructs than as intervening 
variables. The existence of “tension,” for example, is assumed and its 
general properties specified long before quantitative equations can be 
determined. The elaboration of this point of view has been developed 
by Rozeboom, who reaches the following conclusion: 


Once we realize, in fact, that the structure of a system of HC (hypo- 
thetical Constructs) is in itself a predicate sufficient to carry a set of existen- 
Hal operators, we become aware that it is not even necessary for the hy- 
Pothesis to specify in quantitative detail the functional relations by which 
the variables are connected. It is sufficient that the hypothesis merely desig- 
fates for each construct the other variables of which it is an immediate 
unction and those which, in turn, the construct immediately helps de- 
termine, . | , Recognition that the structure of causal linkages among a 
Set of mediation variables is independent of the specific quantitative nature 
of the relations frees us from preoccupation with excessive quantification of 
empirical relations during the early phases of a science, enabling us to 
identify the major immanent determinants of behavior prior to discovery of 
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their exact relations to their causal antecedents and observable consequences 
[119, p. 263]. 


Summary. In order to understand the essential approach of field . 
theory it is necessary to understand what has been called here the 
method of construction. This method, originated in the field of mathe- 
matics, has been employed throughout recent centuries in the natural 
sciences, It is currently being used in psychology by field theorists and 
by many theorists who differ from Lewinians in other respects. The 
method of construction replaces the method of abstraction and puts 
upon the scientist the responsibility to create a system of constructs 
which will relate to one another in a logical system (through conceptual 
definitions) and will refer unambiguously to empirical data (through 
operational definitions). The successful construction of such a system 
will produce concepts which are commensurate and which can be lo- 
cated according to their “type” or “dimension.” The tactical problem of 
how to achieve such a system requires at any given stage of develop- 
ment the use of quasi constructs and the method of approximation. 


MENSURATIONAL AND QUANTIFICATIONAL PROCEDURES 


Use of mathematics. One of the most significant features of Lewin’s 
work lies in the use he made of mathematics. Since his approach to 
mathematics stressed its nonquantitative aspects, one may question 
whether it should be discussed under the heading of “mensuration and 
quantification.” Lewin rejected the implicit assumption made by psy- 
chologists of his day that the use of mathematics is equivalent to treat- 
ing things quantitatively. For this reason, it seems to me to be appro- 
priate to discuss here the broader question of the role of mathematics in 
psychological theory. 

__ Mathematics has, of course, been employed in psychology ever since 
its earliest attempts to become an empirical science. The famous “laws” 
of psychophysics, advanced by men like Weber or Fechner, were stated 
as mathematical equations, and the whole movement of psychological 
testing was based upon statistical techniques. Mathematical statistics 
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as an aid in constructing psychological theory. He was impressed by the 
fact that the natural sciences had employed mathematics not merely to 
process data but as a means of representing essential properties of 
physical events. The fundamental concept of “physical space,” for ex- 
ample, was defined in terms of geometry, and all theoretical manipula- 
tions of the concept were carried out according to the requirements of 
geometry as a logical-mathematical system. Geometry was an indispen- 
sable part of physical theory. Psychological theory too, Lewin believed, 
will have to contain concepts whose elements of construction are mathe- 
matically described empirical entities (such as psychological space and 
psychological forces). 

Topology. The idea that psychology will have to make use of the 
concept of space occurred to Lewin early in his career. In 1912, while 
still a student, he defended the thesis that “psychology, dealing with 
manifolds of coexisting facts, would be finally forced to use not only the 
Concept of time but that of space, too” [91, p- vii]. Thus from the very 
beginning of his theoretical work the concept of “life space” was funda- 
mental, and a search for an appropriate mathematics for representing 
this space occupied a large portion of his energy for many years, In 
order to understand how Lewin arrived at the conclusion that topology 
can best represent the life space, it is important to remember that the 
Scientific world at that time was in a ferment, just beginning to realize 
the full implications of the view that physical space need not necessarily 
be represented in terms of euclidean geometry, The freedom thus given 
to physicists to seek a better mathematical system for representing physi- 
cal space encouraged Lewin to believe that psychological space, too, 
might be represented in terms of a noneuclidean geometry. 

After considering carefully the empirical properties of psychological 
“Pace, Lewin concluded that topology is best suited for the task. It is a 
branch of geometry dealing with the most general properties of space, 
being concerned with transformations which preserve certain properties 
and invariants. Thus, for example, a circle, triangle, and a polygon with 
any number of sides are topologically equivalent being Jordan curves 
(ie. a closed curve which does not intersect itself). Topology is a 
Tigorous mathematical system, permitting logical derivations, and yet 
it does not require the making of assumptions about quantitative prop- 
erties of psychological phenomena—assumptions which do not seem to . 
be Justified by the nature of these phenomena. It provides, moreover, 
the promise of allowing “first approximations” which will not have to be 
abandoned later when more precise quantification becomes possible." 

" This feature of topology makes it quite different from many of the current 


“ t 
mathematical models” which make quantitative assumptions of questionable 
empirical validity. 
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A full discussion of the topological concepts employed by Lewin 
would not be appropriate here; the reader is referred instead to the 
primary sources [91; 92; 95, pp. 305-338]. But in order to clarify some 
of the issues involved in any use of mathematics for theory, it may be 
well at this point to consider one criticism often raised against Lewin’s 
use of topology. This criticism holds that Lewin did not employ any 
theorems of topology to derive psychological events and, therefore, that 
he did not use topology as a mathematical model. It is correct, I believe, 
that Lewin did not explicitly employ any topological theorems in his 
theoretical derivations, but I would question the conclusion often drawn 
from this fact: that Lewin’s interest in topology led him up a blind 
alley. Certain considerations point to the contrary conclusion. 

1. The use of topology made it possible to take seriously the notion of 
psychological space. The elementary concepts of topology, such as 
region, boundary, and path, allow the representation of such basic psy- 
chological phenomena as “approach” or “avoidance” without implying 
that these terms have meaning in a physical space (for example, it is 
possible to represent the fact that a man is approaching “occupancy of 
the White House” or “marriage” without specifying his geographical 
location). The use of topological concepts for this purpose was intended 
to be more than “mere pictures”; if any theorems of topology led to in- 
correct psychological derivations, then the use of topology would have 
to be abandoned or modified. A field theorist would certainly accept the 
proposition that when the primitives of topology are mapped onto 
psychological phenomena all the theorems of topology must produce true 
Statements about psychological properties, It is in this sense that field 
theorists take topology seriously. It should be noted, however, that this 
does not imply that theorems of topology, by themselves, will necessarily 
produce particularly interesting psychological derivations. In fact it was 
in part a conviction that topological concepts are insufficient to repre- 
sent the full richness of psychological phenomena that diverted Lewin’s 
attention away from the theorems of topology as a potential source of 
significant psychological derivations. 

2: ‘Topological concepts, although essential for representing psy- 
chological space and the structural properties of person and environ- 
ment, do not in themselves permit the representation of the dynamic 


aspects of psychological phenomena. Lewin was quite explicit about 
this matter: 


Among psychological constructs one may distinguish geometrical and 
dynamical constructs. Examples of dynamical constructs in psychology are 
instinct, libido, force, tension. Dynamical constructs deal with causation, 
with the conditions of change. They are the main objects of interest in psy- 
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chology. Examples of geometrical constructs are direction, distance, and 
position. 

The geometrical constructs might be called “constructs of the first order” 
as compared with dynamical concepts as “constructs of the second order.” 
The dynamical constructs are based partly upon the geometrical ones [92, 
p. 20]. 


Lewin’s concept of “psychological force” illustrates the interrelations 
between geometrical and dynamic constructs. Its purpose is to represent 
“tendencies to change.” Such tendencies, having location, direction, 
and magnitude, can be represented by the formal properties of force, 
which are direction, magnitude, and point of application. A psychologi- 
cal force can be expressed mathematically as a vector whose direction 
and point of application are stated in terms of the geometrical concepts 
of position and direction. 

A complete formal system which will provide theorems of real in- 
terest to psychology will consist not only of geometrical (topological) 
concepts but also of dynamic ones. 

3. Lewin believed, however, that topology, particularly as presented 
by Veblen [129], was in a sense too general and that certain develop- 
ments of it would be required before it would be entirely satisfactory for 
representing the geometrical aspects of psychological phenomena. He 
Was especially concerned about the inability to deal with concepts of 
“distance” and “direction.” For this reason, he devoted considerable 
energy to the task of inventing a “hodological” space which could better 
deal with these. Rather than assuming that topology was fully developed 
in those aspects of importance to psychology and testing the applicability 
of all its formal properties, he attempted to develop those aspects which 
together with dynamic concepts would provide theorems of value to 
psychology. This extension of topology would, presumably, not violate 
any formal properties of the already developed system. 

Linear graphs. In recent years the University of Michigan’s Research 
Center for Group Dynamics has had a program of work involving the 
collaboration of mathematicians and psychologists and devoted to ex- 
ploring further the possibilities of employing mathematics in the con- 
Struction of psychological theory. From this work certain conclusions 
can be drawn. 

1. Although Lewin discussed the possibility of representing the life 
Space by a multidimensional topological space, his diagrams were almost 
always two-dimensional planar maps. As Harary and Norman [67] 
show, if one ignores the points within regions of this map, it is possible 
to construct a linear graph which will correspond exactly to any planar 
map. The correspondence between planar maps and certain linear 
graphs is accomplished by mapping each region of the planar map onto 
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a point of the graph and each boundary between regions onto a line 
joining these points. An illustration of this procedure is given in Fig. 1. 
The planar map in Fig. 1a consists of four regions, each of which has a 
common boundary are with each other region. The linear graph in Fig. 
lb corresponds to this map since each region of the map has a cor- 
responding point in the graph and each boundary has a corresponding 
line. Bavelas [16] was the first psychologist to attempt this “translation” 
in his restatement (in terms of points and lines) of such concepts as 
“centrality” and “distance” (which Lewin had treated by regions and 
boundaries). I believe that eventually it will be possible to express the 
major properties of Lewin’s hodological space in terms of linear graphs 
since graph theory makes explicit use of the concept “path.” A graphical 


2 


es 


(A) (8) 
Fic. la. A topological planar map consisting of four 


regions each of which has a common boundary arc 
with each other. / 


Fic. 1b. A linear graph corresponding to the planar 
map in which each point corresponds to a region of 
the map and each line to a common boundary arc. 


representation of psychological phenomena has several advantages, 
though it must be noted that linear graphs do not permit one to dis- 
ea parts within points (like points within regions of the planar 
map). 

2. One advantage of linear graphs over planar maps derives from 
the famous “four color” property of such maps. Although topologists 
have never been able to construct a formal proof, they are agreed that 
it is possible to color every map drawn on a sphere or a plane with no 
more than four colors in such a way that no two regions, having a com- 
mon boundary arc, have the same color. Stated in terms more directly 
relevant to psychology, this property of maps makes it impossible to add 
a region to the map in Fig. 1a in such a way that all five regions will 
have a common boundary arc with one another. It is obvious, however, 
that one can add a point to the graph of Fig. 1b and join it by a line to 
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each of the other points by the simple procedure of allowing lines to 
cross. Since it is almost certain that more than four “parts” of the life 
space may all be directly touching one another, linear graphs would 
appear to be more appropriate than planar maps for representing the 
life space.” 

3. Leeper [85, pp. 109-110] has pointed out that Lewin did not 
entirely succeed in representing the type of situation that contains an 
“irrevocable path,” in which a person can move freely from A to B but 
once having reached B cannot return to A. It could be noted also that 
in Lewin’s treatment of inner-personal regions the boundaries were “not 
directed” in the sense that influences could pass equally in either direc- 
tion. This inability to treat asymmetrical relations imposes a serious 
limitation on Lewin’s use of topology. It now appears, however, that the 
mathematical theory of linear graphs is ideally equipped to handle such 
situations by use of the general concept of “directed graph.” As shown 


4 D 


Fic. 2. A linear graph showing “rev- 8 
ocable” and “irrevocable” paths. The 
path from A to B can be retraced after 
each step; the path from A to C cannot 

be retraced; and one can return to A 
after reaching D but without retracing. 


A 


in Harary, Norman, and Cartwright [68], it is possible to treat in a 
completely rigorous fashion structures made up of points and directed 
lines, Figure 2 illustrates a situation in which it is possible to retrace one’s 
Steps along the path from A to B, not possible to retrace along the path 
from A to C, but possible to return to A after reaching D without re- 
tracing. Other more complicated situations could be represented with 
the same concepts. Furthermore, it is possible to characterize rigorously 
Such properties of total structures as “degree of connectedness.” French 
[60] has recently employed certain structural properties of directed 
graphs together with certain dynamic properties to derive several features 
of social influence processes. 

4. A recent development in the utilization of linear graphs for repre- 


"It should be noted that there are other ways in which this difficulty can be 
Overcome, For example, it can be shown that one can color a map drawn on a 
torus with seven or fewer colors. Or, if one allows regions of a planar map to have 
a boundary point (rather than arc) in common, then an infinite number of regions 
Fan touch one another. Neither of these solutions, however, seems as satisfactory 
as employing linear graphs. 
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senting psychological situations has been to designate the “sign” of each 
line of a graph as either plus or minus. Thus, the fact that A likes B is 
represented by a directed line from A to B of positive sign, that A dis- 
likes C by a directed line from A to C of negative sign, and that A is 
indifferent to D by the absence of any line from A to D. Although the 
full ramifications of signed graphs for representing psychological phe- 
nomena have not yet been explored, it does seem that by designating the 
sign of lines one will be better able to handle such polarities as like- 
dislike or facilitation-inhibition. Cartwright and Harary [28] have re- 
cently been able to state Heider’s theory of balance [71] in terms of 
signed graphs and to show how the theory can thereby be generalized to 
situations not originally encompassed in the theory. 

5. In discussing how to determine whether two environmental re- 
gions of the life space have a common boundary, Lewin employed the 
criterion of “possibility of direct locomotion from one to the other.” If 
it is possible to go directly from A to B, then regions A and B have a 
common boundary (or, in graph theory, points A and B are joined by a 
line). If it is impossible to go directly from the one to the other, then 
there is no common boundary. It would appear, in the light of current 
psychological thought, that a better conception might envisage a con- 
tinuous gradation from the “impossible” to the “certainly possible.” The 
connectedness of two parts of the psychological environment would not 
then be viewed dichotomously but as a continuous variable which could 
be stated in terms of probability. The graph theoretical concept of 
“strength,” outlined by Harary and Norman [67, pp. 31-32], would 
seem to lend itself to representing such probabilities. A complete assess- 
ment of this approach, however, has not yet been completed. 

Summary. The field theorist believes that the question of how to 
design techniques of measurement and quantification must be answered 
in accordance with the nature of the theoretical concepts employed. In 
particular, it should not be assumed on a priori grounds that all psycho- 
logical phenomena can be coordinated to continuous variables, Lewin 
maintained that many constructs in psychology would have to be non- 
quantitative and that their corresponding operational indicators would 
also be nonquantitative. This view does not hold that psychology should 
not employ mathematics in dealing with such phenomena. On the con- 
trary, Lewin was one of the first to show that psychologists might fruit- 
fully use branches of mathematics other than the calculus, algebra, and 


cepts of topology, and more recently those of linear graphs, have been 
employed in constructing the basic concepts of field theory. These, it 
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should be noted, do not preclude the use of quantitative concepts, like 
those of distance or strength of force, in the same system. In fact, both 
kinds should be used but they should be carefully distinguished, especially 
in designing operational indicators of the concepts, Scientific rigor will be 
more readily achieved in this way than by attempting to force all data 
to fit such assumptions as those of infinite divisibility and continuous 
gradation. 


FORMAL ORGANIZATION OF THE SYSTEM 


Since Lewinian field theory does not consist of a single formalized 
system of concepts and hypothesis, it will not be possible in this section 
to discuss the formal organization of field theory as if it were such a 
system. It should be noted that several “miniature? formalized systems 
have been developed by field theorists to deal with some limited set of 
phenomena. Among the more explicitly formulated of these are the 
derivations of Lewin [95, pp. 9-20] concerning the concept of tension, 
the quantitative theory of decision-time proposed by Cartwright and 
Festinger [27], the formulation of Heider’s concept of balance advanced 
by Cartwright and Harary [28], and the formal theory of social power 
developed by French [60]. In addition there are, of course, many less 
formalized theoretical treatments of specific empirical data which em- 
ploy Lewinian concepts. Although it would be instructive to examine 
the formal organization of any of these systems, no one of them can be 
taken as “the” statement of field theory. And while it is likely that these 
various “miniature” systems can eventually be combined into a more 
Inclusive one, such an integration has not yet been accomplished. 

For these reasons I have decided in this section to discuss certain 
Problems involved in a rigorous formalization of Lewin’s most basic con- 
structs. In doing so I shall concentrate on the constructs: life space, 
person, environment, and behavior. 

The concept of life space. To the historian of psychology Lewin’s use 
of the concept, life space, should pose some interesting problems concern- 
ing the relations between rigorous and creative thinking. The idea of life 
Space, as noted above, was fundamental to all of Lewin’s theorizing; 
without it the other parts of his conceptual system would be virtually 
meaningless, By using it he was able to proceed in a most creative way to 
Conduct a psychological analysis of a vast array of phenomena which had 
Previously defied any sort of effective analysis. Even his most intuitive, 
or least formal, thinking proceeded with the life space as a basic as- 
Sumption. But in spite of the central significance of this concept, a 
lose Scrutiny of its properties reveals many ambiguities. These are, as 
Might be expected, more pronounced in Lewin’s earlier discussions but 
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they were never completely eliminated. In a sense the life space was a 
“programmatic” concept; it served to indicate an approach, to designate 
a way of thinking about psychological phenomena. 

Although the term, life space, appears throughout Lewin’s writing, 
it is rarely given explicit definition, The essential idea can be grasped, 
however, from the following concise statements, 


Life space. Totality of facts which determine the behavior (B) of an 
individual at a certain moment. The life space (L) represents the totality 
of possible events. The life space includes the person (P) and the environ- 
ment (E). B=f(L) =f(P,E). It can be represented by a finitely struc- 
tured space [91, p. 216]. 


The critical terms in grasping the meaning of life space, then, are 
person, environment, and behavior. And much of the difficulty in under- 
standing Lewin’s use of life space lies in the meaning to be attributed 
to thesé concepts. An examination of Lewin’s formal definitions will 
reveal that each is defined in terms of the others. This procedure is 
essential for the construction of a coherent, logical system, but it leaves 
unsettled the question of how these concepts are related to empirical data. 
Can one, for instance, coordinate “person” to “biological organism,” 
“environment” to “patterns of physical stimuli,” and “behavior” to 
“patterns of motor responses”? The answer is obviously “no” in each 
of these cases, but there is difficulty in arriving at a satisfactory positive 
answer, 

Person. Terms like “person,” “ego,” or “self” have been among the 
most troublesome for psychologists. Lewin, like others, did not always 
maintain a single meaning for the word “person” in his various writings 
over the years. Sometimes he appeared to speak of the person as the 
entity whose behavior is to be studied. In the quotation cited immediately 
above, the word “individual? was used in this sense. Some such term 
is needed to denote the Specific point of reference being used in con- 
structing any given life space, to indicate whose life space is under con- 
sideration. There can be little doubt, however, that Lewin intended to 
restrict the term “person” to refer to certain parts of the life space, i.e., 
to certain parts of the totality of facts which determine the behavior of a 
given individual, For some Purposes it is convenient to represent the 
person as an undifferentiated point and merely to deal with its position 
in the environment of the life space. For other purposes, it is necessary 
to treat the person as differentiated into parts. In either case, the person 
and environment are in interaction, together determining events in the 
life space. As noted in an earlier section, the state of tension associated 
with particular regions within the person and certain properties of 
regions of the environment jointly determine the distribution of forces 
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in the environment, and distance of the person from the goal region 
partially determines the magnitude of forces acting on the person. 

Environment. It is clear that Lewin did not intend for the term, 
environment, to refer to the physical world in which an organism is 
situated nor even to the proximal stimuli which may impinge upon the 
organism’s receptors. The environment, or the psychological environment 
as Lewin sometimes called it, refers to the world as it exists for the in- 
dividual being studied. 


Objectivity in psychology demands representing the field correctly as it 
exists for the individual in question at that particular time. For this field 
the child’s friendships, conscious and “unconscious” goals, dreams, ideals, 
and fears are at least as essential as any physical setting. Since this field is 
different for every age and for every individual, the situation as character- 
ized by physics or sociology, which is the same for everybody, cannot be 
substituted for it [95, p. 240]. 

The food that lies behind doors at the end of a maze so that neither 
smell nor sight can reach it is not part of the life space of the animal. If 
the individual knows that food lies there this knowledge, of course, has to 
be represented in his life space, because this knowledge affects behavior. It 
is also necessary to take into account the subjective probability with which 
the individual views the present or future state of affairs because the degree 
of certainty of expectation also influences his behavior. 

The principle of representing within the life space all that affects be- 
havior at that time, but nothing else, prevents the inclusion of physical 
food which is not perceived. This food cannot possibly influence his be- 
havior at that time under the conditions mentioned. Indeed, the individual 
will start his journey if he thinks the food is there even if it is actually not 
there, and he will not move toward the food which actually is at the end of 
the maze if he does not know it is there (95, pp. 57-58]. 


The insistence that a person’s psychological environment contains 
only those facts which exist for him led Lewin sometimes to speak as if 
he equated the psychological environment with the phenomenal field, 
and there are those who have classified Lewin as a phenomenologist. 
Such an interpretation is given by F. H. Allport as shown in the follow- 
mg quotations: 


Tt was the original intent of Lewin, or so it seems to the present writer, 
to develop a field theory of a consistently phenomenological sort. . . . 

It must also be remembered that it is only the individual’s phenomeno- 
logical experience of objects and of himself among them that make up the 
content of the field... . The field can be described only in subjective 
terms [4, pp. 149-150], 


A similar impression might be created by the discussion of Spence 
[124] in which he equates the approach of Lewin with those of Kéhler, 


) 


68 DORWIN CARTWRIGHT 


Koffka, and Snygg. More specifically, Spence makes the following state- 
ment, 


The methods of determining the structure and properties of these 
fields whether “brain field,” “behavioral environment” or “life space” are, 
however, essentially the same, and . . . involve extensive use of the phe- 
nomenological type of introspection [124, p. 54]. 


Such characterizations of Lewin’s point of view are, in my opinion, 
not strictly correct. Although Lewin held that phenomenal properties 
must be included in the psychological environment and although he 
favored use of subjects’ accounts of the world as they experience it, he 
did not equate the psychological environment with the phenomenal 
world. His basic criterion was whether any given fact had an effect on 
behavior, not whether it appeared in conscious awareness. He explicitly 
asserted that the life space should include within it “unconscious” de- 
terminants of behavior. Leeper [86] has presented a point of view entirely 
consistent with that of Lewin, it seems to me, in his excellent discussion 
of cognitive processes. 


The question is, therefore, whether we should define cognitive processes, 
perception, thinking, concept formation and the like in terms of conscious 
processes exclusively, or whether we should say that consciousness may be 
present or absent, as the case may be, and that all these processes are to be 
defined in terms of their other functional relations [86, p. 735]. 

It seems better, therefore, to avoid narrow definitions and to say that 
cognitive processes include all the means whereby the individual repre- 


sents anything to himself or uses these representations as a means of guiding 
his behavior [86, p. 736]. 


Behavior. Just what empirical data Lewin intended to coordinate 
to the term, behavior, is more difficult to determine. The problem here 
1s not a methodological one, for in the conduct of empirical research 
there is no doubt that Lewin employed all of the standard behavioral in- 
dicators. He “tested” his theories with such data as number of items 
recalled, reaction time, verbal statements of expectations, frequency of 
aggressive actions, speed of performance, and compliance with requests. 
And yet the exact way in which these indicators were related to the 
theoretical concept, behavior, was never entirely clear. At first glance, 
Lewin seems to provide a quite definite linkage by asserting that be- 
havior is coordinated to locomotion of the person in the life space. Thus, 
if in a decision situation the person is subjected to conflicting forces 
directed toward two different regions of the environment (e.g., “going 
to the movies” and “going to a concert”) and if the resultant force is 
in the direction of one of these, then the person will engage in the 
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corresponding activity. But as Leeper [85, p. 116] has noted, since the 
position of the person in the psychological environment both before and 
after locomotion is defined “as it exists for the individual,” all that can 
be derived strictly from a locomotion in the psychological environment 
is a change of the situation as it exists for the individual. Whether or not 
there will be a particular motor performance is another matter. Since 
there is ordinarily a fairly close correspondence (at least in the typical 
laboratory experiment) between the subject’s cognitive representation of 
any action he takes and the experimenter’s description of it, Lewin’s 
failure to make an explicit logical step from locomotion to motor per- 
formance caused little practical difficulty. Nevertheless, Lewin acknowl- 
edged the cogency of Leeper’s criticism, believed it to be quite important, 
and made several attempts to modify his system (see below). 

With this interpretation of the component terms of the concept, life 
Space, it may be instructive to consider the statement made by Brunswik 
[20] that the life space is “post-perceptual and pre-behavioral.” It seems 
to me that Brunswik pointed here to an important issue concerning 
Lewin’s use of the term, life space. This statement can be accepted, how- 
ever, only by ignoring certain features of Lewin’s theorizing, the modi- 
fications he suggested in response to Brunswik’s characterization, and the 
actual procedures he employed in conducting research. Nevertheless, I 
believe that Brunswik was correct in so far as he referred to the more 
explicitly developed parts of Lewin’s treatment. Considering only these 
parts, it appears that the life space is a resultant of certain processes 
(perceptual) and behavior is a change of the life space (rather than 
motor responses). The formula, B = f(L), relates changes in the life 
Space to properties of the life space; B is not to be equated with motor 
response, and L is not the same as stimulus or pattern of stimuli. 

It is perfectly clear, however, that Lewin could not accept this 
characterization of the life space as a satisfactory one. It is clear, too, 
that contemporary field theorists reject it. Although a rigorous formula- 
tion of an alternative conceptualization of the life space has not been 
achieved, the outlines can be indicated. Even in Lewin’s earliest writings 
there are terms which served to indicate the problem, though they did 
little to solve it. Some of these were: alien factors, foreign hull of the life 
Space, and motor-perceptual stratum. In responding to Brunswik, Lewin 
attempted to outline the way in which he would treat the life space, the 
physical and social world, and the relations between these. 


Within the realm of facts existing at a given time one can distinguish 
three areas in which changes are or might be of interest to psychology: 
1. The “life space”; i.e., the person and the psychological environment 


= exists for him. We usually have this field in mind if we refer to needs, 
Motivation, mood, goals, anxiety, ideals. 
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2. A multitude of processes in the physical or social world, which do not 
affect the life space of the individual at that time. 

3. A “boundary zone” of the life space: certain parts of the physical or 
social world do affect the state of the life space at that time. The process 
of perception, for instance, is intimately linked with this boundary zone be- 
cause what is perceived is partly determined by the physical “stimuli”; i.e., 
that part of the physical world which affects the sensory organs at that time. 
Another process located in the boundary zone is the “execution” of an 
action [95, p. 57]. 


The boundary zone of the life space, then, is the locus of perceptual 
(psychophysical) processes and of motor performances. These are located 
in a boundary zone because both perception and motor performance are 
influenced both by the life space and by the physical (and social) world 
external to the life space, and because influences originating inside the 
life space must pass through this boundary to affect the outside, and vice 
versa." This conception is obviously only a first step toward stating the 
problems involved in relating the life space to the physical and social 
world, yet it does provide a basis for integrating the many diverse fields 
of investigation to which psychologists have addressed themselves, 

A suggested formulation. In my own thinking about these problems I 
have found it helpful to employ a schematic diagram of the various classes 
of facts and functional relations with which the psychologist is concerned." 
This diagram is presented in Fig. 3. Each circle stands for a given class 
of events, and each line stands for a given kind of interdependence. The 


environment; S indicates the sensorium, or that part of the boundary 


"Originally, Lewin placed the “motor-perceptual stratum” at the boundary of 
the person rather than th 
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physical-social world which have no influence upon the life space at a 
given time. 

P and E (person and environment) make up the core, or the 
“classical parts” of the life space, but it is clear that the boundary zone, 
containing S and M (sensorium and motorium), should also be con- 
sidered a part of the life space. The solid lines connecting these circles 
to one another are intended to indicate that all these facts are inter- 
dependent and can be handled by a single system of commensurate con- 
cepts. Perceptual processes, for example, are located in § and are both in- 
fluenced by and influence motivational states of P and cognitive states 
of E. 

The dotted lines connecting H (the hull) with § and M refer to 
those processes by which the physical-social world affect the sensorium 


Fic. 3. A schematic diagram of the facts and relations involved 
in the life space, the boundary zone, and the physical and social 
world, For a detailed explanation of the symbols see text. 


of the life Space and by which the motorium exerts effects upon the 
Physical-social world. H and A (alien facts) together make up the 

Objective” world of physical and social facts, and the dashed line 
connecting them indicates that nonpsychological laws govern their rela- 
tions, It should be apparent, however, that this representation of the “ob- 
Jective” world is an oversimplification in that it treats physical and social 
facts homogeneously. } 

In order to illustrate the way in which the diagram may be used, 
let us consider the theory of decision-time advanced by Cartwright and 
Festinger [27]. This theory attempts to account for certain observed 
relations between (1) the degree of similarity of pairs of stimuli, measured 
au; Physical terms, and (2) decision-time, defined as the number of 
milliseconds elapsing between presentation of the stimuli and the record- 
ing of a response. In terms of Fig. 3, the experimental manipulations 
and the measurement of responses occur in H. The sounds produced by 
the €xperimenter in reading the instructions also occur in H, but the 
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processes of “listening to the instructions,” “understanding them,” and 
“abiding by them” take place in the parts of the diagram labeled S, M, 
P, and E. Such factors as a “set toward speed or accuracy” are also 
located in these parts of the diagram and are represented in the theory 
as forces acting in the environment upon the person. The theory asserts, 
in essence, that when a properly prepared subject confronts a given pair 
of stimuli a certain constellation of forces is set up in the environment, 
that the time required for the “resolution” of the conflicting forces 
depends upon the nature of the constellation of forces, and that an ob- 
servable “decision” will not occur until the conflict has been resolved. 
It is of particular interest here to note that the quantitative expression of 
this theory requires the specification of two constants, unique for cach 
subject, one indicating the subject’s basic “reaction time” (a property of 
the system H, $, M) and the other providing a conversion from 
“psychological time” (a property of the system P, E) to “physical time” 
(a property of H). Although the “heart” of the theory is concerned with 
processes occurring in the life space, a complete account of the experi- 
ment must describe events occurring in the entire system consisting of 
H, S, P, E, and M. It is only by taking for granted a whole host of things 
—that the subject is not blind or deaf, that he can understand the in- 
structions, that he is cooperative, that he is capable of verbal responses, 
etc.—that the theory can account for the findings. 

The diagram of Fig. 3 can be employed in a similar fashion to reveal 
the conceptual locus of the variables contained in any of the field 
theoretical investigations. I am convinced that such an analysis helps 
greatly to reveal where conceptual clarification is needed and to point 
to fruitful problems for further investigations. Without attempting a 
detailed analysis of any single study, let us now examine some of the 
general problems which are suggested by this diagram. 

Psychological ecology. Toward the end of his life, Lewin was greatly 
interested in a set of problems which he referred to as “psychological 
ecology.” The task of psychological ecology is, according to Lewin, to 
discover “what part of the physical or social world will determine during 
a given period the ‘boundary zone’ of the life space” [95, p. 59]. In 
terms of our diagram this task is one of describing H. Lewin’s own work 
in this area was confined largely to his “channel theory of food habits.” 
This “theory” consists of an analysis of how a given item of food happens 
to arrive on the dinner table and involves consideration of such factors 
as agricultural technology, the economics of food, the culture of the 
population, and the behavior of “gatekeepers.” Lewin was forced to 
develop such a theory when he realized that the answer to the question, 
“Why do people eat what they eat?” is “Because that particular food is 
placed on the table in front of them.” We shall not consider the details 
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of this theory here, but it is important to recognize that a proper 
description of the H, the foreign hull of the life space, is an essential 
task of psychology. 

This task is a difficult one, without established procedures or concepts, 
and one to which psychologists are not particularly accustomed. It in- 
volves the thorniest problems of psychophysics, in so far as one is dealing 
with the physical world, as Brunswik [21] has so brilliantly shown. And, 
related to the social world, it raises the most perplexing problems of link- 
ing psychology to sociology, economics, and political science, The 
description of the physical world given by psychological ecology cannot 
be identical with that given by a physicist (even though this world 
obeys the laws of physics). Lewin’s analysis of “how food gets on the 
table” is obviously not that of a physicist. An ecological analysis must 
extract those features of the “objective” world which are significant as 
determiners of psychological events. (This is the meaning of the lines 
connecting H to S and M.) 

One suggestive approach toward solving the problem of how to 
describe the psychologically relevant aspects of the physical and social 
world has been advanced by Chein [32]. As a general starting point he 
Suggests classifying these features under five headings: (1) stimuli— 
anything capable of initiating a change of activity, (2) goal objects and 
noxicants—anything which can provide satisfaction or produce pain, (3) 
Supports and constraints—anything which makes particular behaviors 
easier or more difficult, (4) directors—anything which tends to induce 
particular directions of behavior, and (5) global features—such prop- 
erties as stability or degree of organization. 

The whole field of “social perception” is, of course, inextricably in- 
volved in this problem. How does one define the “stimulus properties” 
of a social object? Surely not in the language of physics. The most 
common technique is to compare test responses of the stimulus person 
with attributions made by the perceiver [64], but this methodological 
finesse hardly solves the problem. There have been several clever ex- 
perimental attempts to produce social variables in the laboratory and 
to treat these as “stimuli” for social perception. Notable among these are 
the studies of social power conducted by Pepitone [117] and Thibaut 
and Riecken [126]. Although these manipulations are intuitively con- 
vincing, they are not derived from any clear system of descriptive con- 
cepts, 

In many ways, the most impressive attack on the general problem of 
Psychological ecology is that carried out by Barker and Wright [13] 
in their extensive study of the behavior of children in the town of Mid- 
west, Basic to their analysis are two extra-individual, ecological behavior 
units: behavior setting and behavior objects. 
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A behavior setting is a standing behavior pattern together with the 
text of this behavior, including the part of the milieu to which the behavi 
is attached and with which it has a synomorphic relationship . . . [i.e., one 


4, Comprehensiveness, Behavior settings blanket the community, We 
estimate that in Midwest 95 per cent of all behavior occurs within the bes 
havior settings we have identified. 

5, Variety of attributes, Behavior settings have many discriminable bee 

and nonbehavioral characteristics. They are rich material for 


7, Theoretical position. Behavior settings are present empirical facts, 
They can be demonstrated to the hardest empiricist. However, they are 
theoretically promising. Their internal structure and dynamics and their 
external relations suggest that they can be profitably conceptualized and 
incorporated into productive theory [13, pp. 9-10). 


Behavior objects are such things as dolls, books, ladders, and tooth- 
brushes. They are differentiated from behavior settings on two grounds, 


as he does in the case of a store and a picnic. . » » Second, behavior ob- 
jects are, themselves, located within behavior settings (13, p, 10). 
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at targets. In this little-known study, Voigt had subjects shoot a light- 
pistol, without taking aim, at targets of various kinds and at varying 
distances. He found, among other things, that the angular precision of 
shooting is related to distance in a way indicating that steering is guided 
by perceived size of target (influenced by size constancy) rather than by 
retinal size of the image. He was also able to show that steering of the 
motorium is a joint function of the perceptual field and properties of the 
motorium. The conceptual locus of this investigation, in terms of our 
diagram, is in the area containing S, P, E, and M. More specifically, the 
problem under investigation is one of determining how the motorium is 
steered by the sensorium through the intervention of the person and en- 
Vironment.”” Though this investigation was concerned with the problem 
of how the individual produces events in the physical world, it may 
serve as a paradigm for stating the problem for any type of “overt” 
behavior, 

Lewin’s discussion of how the person and psychological environment 
influence the motorium was essentially preliminary and programmatic, 
‘The following quotation summarizes his general view of the nature of the 
problem. 


There seem to be at least two rather different relations between the 
Motoric and the inner- nal region of the person, The one is character- 
ized by the interreletices BIA the tensions of these regions (such as 
Spreading of tension), This type of interrelation can be logically derived 
from the concept of tension, Besides, the motoric can have the position of a 
tool for the inner-personal systems. In other words, between the inner- 
personal region and the motoric there exists the relation of an inducing 
thing to an induced field, or of thing and medium. The second type of in- 
terrelation seems to increase in importance from infancy to adulthood (92, 
Pp. 105-106). 


This general statement of the relations between the motorium and the 
Fest of the life space focuses attention on three theoretical problems re- 
quiring further investigation. 

1. What properties does the motorium have? Lewin's own thinking 
about this question was greatly influenced by the distinction made by 
Heider [69] between “thing” and “medium.” The motorium is a 
medium through which influences from the person or psychological 
environment are exerted upon the physical or social world, For certain 


by eae iaa between M and H in this experiment were rather rigidly fixed 

mebi a light-pistol, thus producing a one-to-one correspondence between 

of the pistol and the effect produced on the target. Had the experiment 
concemed with something like dart-throwing or bombing, the results as meas- 

bred at the target would abo have been influenced by physical conditions interven- 

ing between “aiming” and “hitting.” 
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purposes, in dealing with transactions with the physical world (walking, 
performing a motor task, driving a car, etc.), the motorium may be con- 
sidered to be the muscular apparatus of the organism. Even in such 
cases, however, it may prove to be useful to consider tools as a part of the 
motorium. The psychological significance of certain physical handicaps 
can be properly understood only by examining the way in which the 
motorium serves or fails to serve the individual in the execution of his 
needs. The problem of how to characterize the motorium when dealing 
with social actions is even more difficult. It is clear, however, that 
properties of the person and his psychological environment do somehow 
get converted into events in the social world and that individuals vary 
greatly in their social skills. An individual’s social behavior may be 
flexible or rigid in much the same sense that his physical behavior may. 

The formal properties of the motorium have not been systematically 
specified. Lewin, thinking of the motorium as a medium, suggested that 
it be considered as consisting of differentiated parts. To be effective 
the motorium needs to be highly differentiated into parts which can 
“act” independently of one another, The difference, for example, be- 
tween a clumsy and a dexterous performance may lie partially in the 
capacity of fine parts of the motorium to be steered independently of one 
another. Other properties of the motorium, obviously, need to be 
specified. 

2. How do tensions in the person affect the motorium? It appears 
from research like that of Dembo [37] on anger and Barker, Dembo, 
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in a “head-tool” relation to the motorium. An exploitation of this ap- 
proach to the problem should illuminate such matters as the “integration” 
or “hierarchial organization” of overt behavior. P 

Concentration on the relations among person, environment, and 
motorium bring certain psychological problems into greater prominence. 
In my own research [24] on the purchase of government bonds during 
World War II, for example, it became clear that a specification of the 
cognitions and motivations of people was not sufficient to predict with 
much success whether or not they would buy bonds during a given 
period of time. If, however, these data were considered in conjunction 
with two other kinds of information, rather good predictions could be 
made. The additional information needed concerned: (1) whether or 
not the individual was placed in a situation where he had to make 
a decision to buy or not to buy, and (2) how much money he possessed. 
The first factor, that of forcing a decision, served to place the motorium 
under the control of a motivational system (P and E). The second factor, 
possessing money, is more difficult to conceptualize, but it points to the 
fact that the motorium requires resources of particular kinds for the 
execution of particular overt actions. 

Problems of “skill” are also brought to the fore by considering the 
Motorium. Schools of psychotherapy which concentrate exclusively upon 
Problems of motivation seem to be overlooking an important aspect of 
the ways in which individuals cope with the physical and social world. 
Just as older notions of “retraining” were one-sided, so too are modern 
views that therapy consists of “straightening out motivation.” Freudian 
Proponents of “ego” functions, like Redl [118], would seem to have a 
more balanced view of the total problem. Similar comments are appro- 
priate concerning the field of “human relations training.” Here such 
techniques of “skill training” as represented in certain uses of role play- 
ing [104] are just as important as efforts at modifying the motivation or 
Values of those being trained. 

Circular processes involving the system: P, E-M-H-S-P, E. A careful 
examination of Lewin’s theories of motivation makes it clear that he 
could not escape the problem of how processes within the life space relate 
to the foreign hull of the life space. This conclusion is perhaps dramatized 
most vividly in his treatment of the distinction between substitute valence 
and substitute value which was forced by the experimental findings of 
Dembo [37], Lissner [106], and Mahler [108]. Here it was found that 
when ‚the achievement of a goal is blocked certain “substitute activities” 
acquired valence. Not all of these substitutes, however, served to reduce 
the original tension system (i.e., to have substitute value). It would 
appear that the phenomenon of “substitute valence” can be handled 
adequately within the system made up of P and E. Ordinarily, however, 
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a tension system will not be released unless certain events take place in 
M which, in turn, produce certain changes in H. Furthermore, tension 
will still not be released unless these changes in H produce effects in $ 
and consequently in Æ. It is possible, of course, for there to be “autistic” 
need satisfaction, but such instances must be distinguished from “realistic” 
gratification. It is only through the operation of such a circular system 
involving the life space, the boundary zone, and the foreign hull that 
an individual can “keep in touch with reality.” 

Lewin’s concept of barrier, or restraining force, can be refined, too, 
by a careful analysis of all of the factors represented in Fig. 3. The formal 
definition of barrier as that which offers resistance to locomotion appears 
to refer exclusively to the psychological environment since these re- 
sistances exert influences opposite to the driving forces which clearly 
exist in Æ. But Leeper has pointed out certain ambiguities in Lewin’s 
various discussions of barriers, ambiguities incidentally which Lewin 
readily acknowledged. 


Lewin’s concept of barriers has been confusing because it has not been 
made to conform consistently to one or the other of two types of factors, 
both of which must be recognized but distinguished. His basic concept of 
barriers, as factors that give rise to restraining forces that operate only to 
impede actual locomotion through the region, is suited only for the de- 
scription of what might better be called objective obstructions (regardless 
of whether they are outside the person or obstructions of the type of im- 
perfect skills, poor vision, or other limitations of the motoric regions). Such 
objective obstructions operate regardless of whether the person psycholog- 
ically appreciates their existence. Their operation must be discounted before 
data on speed of locomotion, for example, may be taken as indicating the 
strengths of driving forces. But, in addition to this concept of objective 
obstructions we need also a concept of barriers as regions of the psycholog- 
ical environment. Barriers in this sense may operate by affecting valences, 
by giving rise to restraining forces that can operate at a distance, and by 
Screening out the driving forces related to more remote regions. In deter- 
mining the behavior of the moment, what chiefly must be considered is 
barrier-regions as defined in this second sense [85, pp. 107-108]. 

With the aid of the diagram of Fig. 3, it seems to me that the im- 
port of Leeper’s criticism can be better appreciated and that a slightly 
different solution to the problem may be advanced. It is clear that 
obstacles to the objective achievement of some goal may reside in $ (e.g., 
perceptual insensitivities), in M (e.g., inadequate skills), or in H_(e.g., 
an object is too heavy to lift or a law prohibits a foreigner from becoming 
President of the United States). It is clear, too, that these obstacles may 
prevent an individual from reaching his goal even though he did not 
know that they were present (i.e., they were not present in the psy- 
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chological environment). It is also clear that barriers may exist in Æ 
which have no valid counterpart in $, M, or H (e.g., the person believes 
erroneously that he cannot do something). But it is more usually the case 
that the individual makes use of the entire circular system composed of 
P, E, S, M, and H to keep in touch with reality with respect to barriers 
just as he does regarding sources of gratification. Barriers, it seems to me, 
can be best conceived as restraining forces, and the various distinctions 
required can best be represented by designating in each instance the 
“source” of the force and its point of application. Depending on the 
circumstances of the particular case, restraining forces may exist in E 
with sources in $, M, or H, and the way in which they can be overcome 
will depend upon the nature of the source. Restraining forces may also 
exist in $, M, or H and will have different effects in each instance. 
There are a host of social psychological theories which refer at least 
implicitly to the circular system made up of life space, boundary zone, 
and social world. Most of these theories invoke a general notion of “equi- 
librium” to refer to the fact that certain states of P and E are in some 
sense preferred and from this fact they derive various tendencies toward 
cognitive distortion, toward selective contact with the social world, and 
toward certain kinds of social communication. Thus, Newcomb [115] 
postulates a strain toward symmetry in the attitudes of two or more 
people in social interaction and derives from this various predictions con- 
cerning selective communication and pressures to change opinions. The 
theory of social communication advanced by Festinger [49] specifies how 
the need for “social reality” leads to pressures to change one’s own 
Opinions or those of others. His forthcoming theory of “cognitive dis- 
sonance” will indicate further how certain cognitive properties lead an in- 
dividual to seek out social interactions of a certain kind. Newcomb [114] 
has, also developed a theory of “autistic hostility” to account for the 
tendency of people to avoid interaction with others toward whom they 
have hostile feelings, a tendency documented by Festinger and Kelley 
[51] and by others. All these theories are concerned, in one way or 
another, with the way in which properties of P and E influence M, the 
way in which M influences H, and the way in which changes produced 
in H influence $ and eventually P and E. Most of these theories, how- 
ever, “short-circuit” one or more of the steps involved by assuming one- 
to-one correspondence between each end of the link (e.g., by assuming 
veridicality of perception). Perhaps their future elaboration will ex- 
Een more closely the processes involved at each step in the circular 
stem, 
Lewin’s emphasis upon the person and psychological environment has 
helped to counteract a tendency among some theorists to speak as if be- 
havior were determined entirely by the system made up of H-S-M-H. 
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Although it is apparent that some processes do involve primarily this 
system, such “reflexes,” which depend only slightly upon properties of 
P or E (i.e., upon motivational states and cognitive structures), con- 
stitute only a small part of the phenomena of interest to psychologists. 
Studies like those on the influence of needs and values on perception 
[19] and those pursuing such distinctions as that drawn by Gelb and 
Goldstein [65] between abstract and concrete “mindedness” are con- 
cerned with the intervention of properties of P and E in processes taking 
place in H-S-M-H. The task of research is, of course, to determine under 
what conditions P and E do or do not enter into the total process and 
to describe the nature of such interventions. 


CONCLUSIONS" 


This rather general account of Lewinian theory suggests certain 
conclusions about the needs of psychology. If other theoretical orienta- 
tions also point to the same needs, so much the better. It may be hoped 
that field theorists and theorists of other schools will work in future years 
to satisfy these needs. 

Empiricism and formal theory. Contemporary psychological theory 
is in danger of losing touch with empirical reality. The placing of too 
great a value upon formal elegance in the construction of theory may well 
create an insurmountable chasm between the theorist and the psychologist 
who is interested in the naturally occurring behavior of people. Unless 
future interest in “model building” is closely guided by an unbiased 
reference to empirical facts, formal elegance will be purchased at the 
cost of empirical applicability. Psychology requires a balanced respect for 
both empirical facts and rigorous theory. Open-minded description is 
needed to keep psychology continually concerned with its proper subject 
matter, but adequate description cannot be accomplished without formal 
theory. While the traditional emphasis of Lewinian psychologists upon the 
hypothetico-deductive method is still appropriate, one should not forget 
that formal theory is useful in an empirical science only in so far as 
It serves as an aid to description. 


The need for a proper balance between naive empiricism and 


. “At this point I omit the following sections contained in the outline of the 
editorial committee: “Scope or range of application of system,” “History of sys- 
tem to date in mediating research,” “Evidence for the system,” and “Specific 
methods, concepts or principles of the system believed valuable outside the context 
of the system.” These topics have been included in the previous sections to such 
an extent that further discussion would be excessively repetitious. In this final sec- 
tion, I have tried to combine the two suggested topics: “Degree of program- 


maticity” and “Intermediate and long Tange strategy for the development of the 
system. 
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formalization has implications for the collection of data as well as for 
the building of theory. It might be well for psychologists to take another 
look at their use of the experimental method, which has long been one 
of psychology’s most powerful tools. An experiment, especially when 
coupled with the hypothetico-deductive method, seldom provides in- 
formation other than that “called for” by the experimenter, and its re- 
sults are rarely more significant than the theory which guides the experi- 
menter. Too much of current psychological research, I fear, is designed 
not so much to discover new facts as to confirm some derivation from a 
limited formal theory. It is in the formulation of research problems that 
the major advances of psychology take place. Careful observation, record- 
ing, and measurement of naturally occurring events and of “experiments 
of nature” will for a long time to come be the most important source 
of the significant problems of psychology. For this reason, it would seem 
wise for psychologists to avoid any premature judgment that painstaking 
observation is inferior or antithetical to rigorous theory. Methods of re- 
search, like those employed by such ethologists as Tinbergen [127], 
could well be developed and refined in other branches of psychology. 
Generality of theories. Psychology, like other disciplines, has in recent 
years been subjected to strong pressures toward specialization. Efforts 
to develop rigorous theory have often resulted in a sharp restriction of the 
range of empirical phenomena treated by a given theory. One result has 
been that the boundaries around various fields of psychology have tended 
to become impermeable barriers so that the theoretical constructs of one 
domain have no clear relation to those of another. Field theory, by 
focusing attention upon the concrete manifold of interdependent facts 
influencing behavior, attempts to resist this theoretical narrowness. It is 
the goal of field theorists to develop constructs which will be equally 
applicable to perception, learning, motivation, personality, development, 
Social, normal, and abnormal psychology. While it is undoubtedly true 
that attention has been directed to some aspects more than others, field 
theorists still believe that it is possible, in principle, to employ a single 
System of commensurate constructs for dealing with all the facts of 
Psychology. This is not to say that the task has been accomplished ; 
oa directed toward the integration of psychological theory are badly 
eeded. 
; The importance to field theory of the concept, life space, as the total 
Situation in which individuals exist has tended progressively to broaden 
interest from problems of motivation and learning to those of social 
Psychology. It has become increasingly clear that social relations must 
taken into account in any open-minded description of the way in 
which behavior is generated and modified. The work of Lewin, Lippitt, 
and White [98] demonstrated that a personal characteristic like “agores- 
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siveness” can be basically modified by the nature of groups to which the 
individual belongs. Lewis [100], Deutsch [39], and Horwitz [74] have 
shown how the quality of relations among individuals affects the satis- 
faction of needs and the release of tension. And many of the studies on 
level of aspiration reported by Lewin, Dembo, Festinger, and Sears [97] 
have made it clear how goal setting and the experience of success or 
failure depend upon the social groups to which individuals refer them- 
selves. Significant progress is also being made in putting to empirical test 
theoretical notions like those of Mead [110] concerning social influences 
on the formation of the self. These and other trends all indicate how 
essential it is that social Psychology be encompassed in a single theory of 
human behavior. 

A theoretical problem of high priority, if psychology is to deal with 
social phenomena, is how to conceptualize the interdependence and inter- 
action of several individuals, One might state this problem as one of 
determining how to “combine”? individual life spaces, though I fear that 
this may be a misleading way of putting the matter. In one of his last 
publications Lewin [95, pp. 188-237] undertook to formulate the nature 
of the problem, but he did little more than indicate its major features. 
It is interesting that he employed such terms as “social field,” “social 
space,” and “group life space.” He suggested that such notions cannot 
be avoided and pointed out the necessity of developing concepts for link- 
Ing properties and states of the individual to those of the social field. 
A fundamental difficulty, he noted, arises in going from the individual 
life Space, which is constructed to represent the world as it exists for the 
individual, to the social field, which must encompass the “subjective” 
situations of many individuals, In my own opinion, the most promising 
approach to this problem will be one which focuses on ecology, that is, on 
the conceptualization of the hull of the life space, 

As Soon as one goes out of the laboratory into “life” and attempts 
open-mindedly to observe what people do it becomes apparent that 


can be incorporated into a coherent Psychological theory, and research 
on power [26] provides a clue as to the way in which variables tradi- 
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person and psychological environment, It would appear that the im- 
portant extensions of field theory in the foreseeable future will consist 
of working out a detailed way for treating the sensorium, motorium, and 
foreign hull of the life space with concepts commensurate to those now 
employed for dealing with the person and psychological environment. A 
good beginning has been made in the work of psychological ecology, 
where it is realized that the “objective” environment must be described 
in terms having psychological rather than physical meaning, If current 
efforts, such as those of Zajonc [141], to describe “messages” and 
“cognitive structure” in the same set of concepts prove successful, another 
bridge between the inner core of the life space and its foreign hull will be 
established, 

Demonstrations of the influence of needs and values upon perceptual 
Processes serve to underscore the necessity for treating the person, psy- 
chological environment, and the sensorium as an interdependent system. 
Such demonstrations in and of themselves do not, however, contribute 
a great deal to the unified conceptualization of all these processes. 
Motivation, cognition, and perception have to be described in com- 
Mensurate terms before a real understanding of these phenomena will be 
attained. Much the same difficulties exist in dealing with the motorium. 
A conceptual analysis of the motorium, of what skill “is,” of how steering 
takes place, and of how the motorium acts on both the social and 
Physical world should improve greatly our understanding of human 

havior, 

The conceptual representation of “value” especially deserves at- 
tention. Although the study of values has at last attained a modicum 

Tespectability among scientific psychologists, research efforts thus far 
have been confined mainly to problems of measurement. From a field 
theoretical point of view, however, the most urgent task is to determine 
the conceptual properties to be attributed to the construct “value.” Lewin 
Suggested that “value” and “goal” are of different conceptual types, that 
Boal” is a central force field while “value” induces force fields. This 
observation led him to conclude that values and individuals are similar 
in that both are capable of inducing forces, a conclusion which is con- 
Sruent with much of psychoanalytic thinking. This insight is quite im- 
Portant because it links problems of value to those of interpersonal rela- 
tions and social power, but it provides only a beginning. An urgent task 
of Psychology is to obtain answers to questions like these: How do values 

me established in an individual’s life space? How does a value be- 
©ome activated? How do values relate to one another? What effects do 

ues have? In what ways do values differ from needs or motives? 
„WEIS to questions such as these will help specify the conceptual prop- 
erties of “value” and put psychology in a position to provide a con- 
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ceptually integrated treatment of the rapidly growing research on such 
topics as self-esteem, attitudes, authoritarianism, social motives (like 
those for achievement and affiliation), and national character. 

The interest of field theorists in showing how cognitive structure 
affects behavior has diverted attention, perhaps too much, away from 
the determinants of cognitive structure itself. It is true that Lewin set 
up the general principle that psychological forces may result either in a 
locomotion of the person or a restructuring of the psychological environ- 
ment, but the conditions under which one or the other will take place 
were never clearly specified nor was it ever entirely clear exactly how 
restructuring takes place. The recent work of Zajonc [141] on “cognitive 
tuning” suggests one promising approach toward a better treatment of 
these problems. 

To the critic of Lewinian theory it is especially interesting that 
Lewin’s absorption in dynamics and in problems of ontogenetic develop- 
ment did not result in richer specification of the laws of development and 
change. Some might be inclined to attribute this fact to his insistence 
upon the principle of contemporaneity; his dynamics always referred to 
a relatively small time differential. I am inclined to believe, however, 
that the reason lies in the intrinsic difficulty in formulating such laws and 
especially in the necessity of conceiving of the entire causal system made 
up of person, psychological environment, sensorium, motorium, and 
foreign hull of the life space. The most concerted attack on this problem 
undertaken by Lewin [95, pp. 170-237] led him into an analysis of 
psychological ecology and into the construction of a circular causal sys- 
tem involving the life space and its foreign hull. As work proceeds on the 
sensorium, motorium, psychological ecology, and on the determinants 
of cognition, motivation, and action, field theory will be better able to 
deal with problems of change and development. 
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INTRODUCTION 


I would like to begin by letting off steam. If in what follows I have 
not done a very clear or useful job, I would plead some half-dozen 
reasons. First, I think the days of such grandiose, all-covering systems 
in psychology as mine attempted to be are, at least for the present, 
pretty much passé. I feel, therefore, that it might have been more 
decent and more dignified to let such an instance of the relatively 
Immediate dead past bury its dead. Secondly, I don’t enjoy trying to 
use my mind in too analytical a way. Hence, I have found it frustrating 
and difficult to try to subject my system to the required sorts of analy- 
ses. Thirdly, I suppose I am personally antipathetic to the notion that 
science progresses through intense, self-conscious analysis of where 
One has got and where one is going. Such analyses are obviously a 
Proper function for the philosopher of science and they may be valua- 

le for many individual scientists. But I myself become frightened and 
restricted when I begin to worry too much as to what particular logical 
and methodological canons I should or should not obey. It seems to 
me that very often major new scientific insights have come when the 
Scientist, like the ape, has been shaken out of his up-until-then ap- 
Proved scientific rules such as that food can be reached only by the 

d and discovers “out of the blue,” and perhaps purely by analogy, 
[66] the new rule of using a stick (or a sign-gestalt). Fourthly, [have 
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an inveterate tendency to make my ideas too complicated and too 
high-flown so that they become less and less susceptible to empirical 
test. Fifthly, because of increasing laziness, I have not of late kept up, 
as I should have, with the more recent theoretical and empirical 
discussions which bear upon my argument. If I had, the argument 
would have been different and better and also I would have given 
more credit to those to whom credit is due. F. inally, to talk about one’s 
own ideas and to resort frequently to the use of the first person singular, 
as one tends to do in such an analysis, brings about a conflict, at least 
in me, between enjoying my exhibitionism and being made to feel 
guilty by my superego. However, I am probably merely giving vent to 
spleen and had better turn to the argument itself, 


BACKGROUND FACTORS AND ORIENTING ATTITUDES 


Rather against my will and because of parental pressure, I 
graduated from the Massachusetts Institute of Technology. I had been 
good in science and mathematics in high school and there seemed no 


with real organisms in terms of their inner psychological dynamics. 
That is, I rejected the extreme peripheralism and muscle-twitch-ism of 
Watson. And I was tremendously influenced by Perry [68], who demon- 


problem was to get these central Processes which I came to call 

2 N : A AE 5 ; 

intervening variables” [79] out into objective pointer readings. I 
“In his Hitchcock Lectures at the University of California, Berkeley, 1955. 
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suppose I also had some yen to deal with the more embracing prob- 
lem of personality structure. But there was not much in the way of 
Freudian or other notions (save intelligence tests) concerning personal- 
ity structure available in those days, at least in America. 

In any event, what happened with me—as happened, I expect, 
with many another beginning psychologist of that era—was that 
having been attracted to psychology in the first place by personal and 
social needs, I proceeded, once I got in, to retreat to such “safe” sub- 
jects as learning and memory with occasional forays into motivation, 
usually at a “safe” biological level. 

When I got to California, I more or less accidentally started a rat 
laboratory. And I became tremendously interested in trial-and-error 
learning as previously investigated by Thorndike. I wanted to defend 
Thorndike on the importance of motivation as against Watson’s 
claims of mere frequency and recency as the determiners of the gradual 
(or relatively sudden) increase in the correct response. However, I did 
not like the Law of Effect. I felt the need for the construct of a driving 
motive, relative to which, and only relative to which, the goal (or the 
punishment) constituted an “effect.” I also felt that a response could 
not be defined as a specific muscle contraction but must in some way be 
defined as a directed, goal-oriented manipulation or “performance.” I 
still feel that “response” is one of the most slippery and unanalyzed 
of our current concepts.* We all gaily use the term to mean anything 
from a secretion of 10 drops of saliva to entering a given alley, to 
running an entire maze, to the slope of a Skinner box curve, to achiev- 
Ing a Ph.D., or to a symbolic act of hostility against one’s father by 
attacking some authority figure. Now, I ask you! 

Finally, of course, I was much influenced by gestalt psychology. 
Köhler’s studies of problem solving in apes [38] were terribly illuminat- 
ing to me and reinforced my “centralist” notions. However, I re- 
Mained at heart an associationist and, although I became convinced 

at the whole in some degree governs its parts, these wholes, I felt, 
Were acquired by learning and were not autochthonously given, That 
is, I talked about acquired sign-gestalten rather than about innately 
ready, pure Perceptual gestalten. As for perception, I have always been 
about as sleazy about it as I have been about responses. The “stimulus” 
AS we use it today seems to be just as slippery a term as that of response. 
ut I am afraid I haven’t been able to do much about clarifying it. 
turn now to some of the specific rubrics which have been sug- 
ested under “orienting attitudes.” 

"For this Teason I have been much stimulated by Barker and Wright’s recent 
attempt [3] to Specify more significant units of behavior, though I have not yet 
been able to incorporate their suggestions into my own thinking. 
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The nature and limits of psychological prediction. I have, on 
the one hand, tended to assume that, if we knew enough, all responses 
(performances), great or small, could be predicted—meaning individ- 
ual performances on individual occasions. And, on the other hand, I 
have also been aware that what I actually do as a rat psychologist is to 
try to predict merely the average responses of a specifically defined 
group of rats under a specific set of conditions. This would mean that I 
could expect to predict the response of the individual rat only with a 
certain degree of probability. To a large extent this circumstance that 
one can predict the individual’s response only as having a certain 
probability is, I think, a part of what Brunswik argued for [10, 89]. 

Level of analysis. Further, I have been interested primarily in 
getting “behavioral” laws at their own level. It seemed that psychol- 
ogy, when I started, was being handicapped by trying to take over the 
inadequate reflex notions of the neurology of the time. Later, of course, 
I also became more aware of the social determiners of behavior. And, 
having a penchant for the vague and the confused, I was once led [81] 
to suggest a position opposite to the standard “reductionism.” The 
reductionist point of view would hold that social behaviors and indi- 
vidual psychological behaviors are to be explained by neurophysio- 
logical processes. I Suggested that, in a certain sense, this could be 
turned upside down and that we might assert rather that the laws of 
sociology come first, since they determine the response ranges and the 
nature of the stimulus settings actually available to the individual; and 
that the laws of individual behavior come next, for these determine the 
range of neurophysiological process that can occur. Thus, while not 
denying that in one sense physiology “explains” psychology and 
psychology “explains” sociology, I did then, and I do now, assert that, 
in another equally important and perhaps more pragmatically useful 
sense, sociological phenomena control (explain) psychological phe- 
nomena and psychological phenomena control (explain) neuro- 
physiological ones. But this was all relatively aside from my main 
experimental interests and I have not made use of these ideas in any of 
my theorizing on learning. 

Utility and role of “models.” Although I myself have sometimes 
used the term “model,” it has come to mean so many different things 
from, say, an equation-form, a sociometric diagram, a mechanico- 
electrical rat, a stochastic model of learning, to a chart for the inter- 
relationships between independent, intervening, and dependent 
variables, that the word has lost for me any specific meaning. As I 
see it, the word “model” is merely a sloppy synonym used, when one 
gets tired of some more specific term, to designate the specific type of 
theoretical organization that one is proposing, 
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Comprehensiveness of empirical reference. I have always 
wanted my psychology to be as wide as the study of a life career and 
as narrow as the study of a rat’s entrance into a specific blind. But, 
actually, I have for the most part studied only average numbers of 
entrances into specific blinds, or jumps to a particular type of door, 
and have hoped that the principles found there would have something 
to do with a really interesting piece of behavior—say, the choice and 
pursuit of a life career. 

Degree and mode of quantitative and mensurational specificity 
toward which it is desirable and/or feasible to aim. I am very 
relaxed about this. Be as fine in your measurement as you can. But, 
under those overcontrolled artificial conditions in which you can be 
very fine and get great specificity and fineness of measurement, the 
relations or “laws”? which you uncover are probably going to have 
very little generality. Psychology today seems to me to be suffering 
from a flight into too high-powered a statistics and into too over- 
mathematized a fitting of curves. The experimental situations for 
which one can achieve such overrefinement probably do not obey the 
canons of “representative design” —see Brunswik [10]. But I am obvi- 
ously again here venting my spleen. I did learn quite a lot of mathe- 
matics once, but I have forgotten it and have not kept it up; and so I 
tend to minimize the importance of too refined measurements and too 
developed a mathematical treatment until we really know a great deal 
more about the general qualitative lists and patterns of the determiners 
of behavior than we as yet do. 

Type of formal organization. I believe that my various forays into 
theory vary all the way from “informal exposition” to a few minor 
attempts at relatively “explicit hypothetico-deductive axiomization.” 
Por the most part my assumed variables and my assumed functions 
Connecting them to one another have been in the form of mere loose 
qualitative hunches though sometimes they have been stated in would- 

© More precise quantitative and mathematical terms, e.g., see the 
discussions of VTE [82, 83, 88]. Apparently, I have no scientific 
Superego which urges me to be mathematical, deductive, and axio- 
matic. My intervening variables are génerally speaking mere tem- 
Porarily believed-in, inductive, more or less qualitative generalizations 
Which categorize and sum up for me (act as mnemonic symbols for) 
Various empirically found relationships. They do not lead to any then- 
and-there precise further quantitative predictions. They do, however 
(when mixed with a healthy brew of intuition, common sense, and 
Phenomenology), lead to suggestions for further types of empirical 
relations to be tested. That is, they are loosely defined “hypothetical 
Constructs’ —see MacCorquodale and Meehl [48] on hypothetical con- 
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structs vs. intervening variables—which lead to hypotheses through 
intuition, common sense, phenomenology, and very general notions 
about neurophysiology. Psychology, given all its many parts, is today 
still such a vast continent of unknowns that it has always seemed to 
me rather silly to try to be too precise, too quantitative, too deductive 
and axiomatic, save in very experimentally overcontrolled and over- 
limited areas. 


STRUCTURE OF THE SYSTEM AS THUS FAR DEVELOPED 


1. Exhaustive itemization of systematic, independent, intervening, 
and dependent variables. 

2. Mode of definition of representative variables of each category. 

3. Major interrelations among constructs. 

4. Discussion of order of determining and other characteristics of 
construct linkages, 

The above four rubrics are Suggested in the outline. But I shall 
consider them together by presenting and discussing a series of figures 
in which I shall try to indicate the major independent, intervening, 
and dependent variables postulated by me.* But I shall not attempt to 
be “exhaustive” as suggested in rubric {1}. That is, I shall not now 
attempt to revive all the flights of fancy that I may have indulged in 
at one time or another in the past. Rather, I shall restrict myself to a 
consideration of what I consider to have been my major theoretical 
constructs, reformulating and elaborating some of them for the pur- 
poses of what I now believe to be a more coherent and more empir- 
ically useful set of statements, I shall also limit my discussion to a 
consideration of the variables and interconnecting functions to be 
assumed in the explanation of the behavior in a few of the basic experi- 
ments on the instrumental learning and performance of rats, F or, as I 
indicated above, I believe that, if these can be clearly envisaged, some 
fundamental principles will be found which can eventually be ex- 
panded and built upon so as to become helpful for the explana- 
tion of human behavior. Here and there I shall suggest such further 
human implications. 


However, the “surplus meanings” of my intervening variables which make them 
into hypothetical constructs are not at this stage primarily neurophysiological, as 
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dependent variables—and the interconnecting causal connections 
(the latter represented by arrows) which I conceive to be involved in 
the learning and performance of (1) a simple approach to and eating 
of food, (2) a case of simple escape from an electric shock, and (3) a 
case of simple avoidance of an electric shock. In subsequent charts, I 
shall consider two more complicated setups, viz., choice-point learning, 
and latent or incidental learning. 

A case of approach. Independent variables, Look at F ig. 1. Look first 
at the independent variables. These have been separated into two 


Independent variables Intervening variables Depend] 
(Saal ~ ~ ent 
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Fic. 1. Approach 


columns labeled “past” and “present.” In the “past” column is a 
complex symbol which stands for a complicated interconnected array 
of independent variables which have occurred in the trials preceding 
the one in question, or in other somewhat similar previous situations. 
Tn the Second, “present,” column are the symbols which stand for the 
Specific and particular variables which are presented to the animal at 
G beginning of the given trial, that is, on the trial the results of which 
are to be predicted. 
Before, however, examining these symbols for the independent 
Variables in more detail, let us look briefly at the other parts of the 
ure, 
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Intervening variables and the dependent variable. It will be noted that 
the intervening variables also are divided into two columns: means- 
end readinesses in the first column; and perceptions, representations, 
expectancies, and valences in the second column. Finally, the last 
column in the chart contains the dependent variable: namely, in this 
case, the to-be-predicted strength of the tendency on the given trial to 
approach the food. This latter is symbolized as R,*.4 

It should be emphasized that because of common practice, I have 
throughout symbolized the manipulatory organism-environment 
rearrangements (i.e., performances) which I am concerned with, by 
the letters R or r. And, for literary convenience, I have often called 
them “responses” rather than “performances.” It is to be stressed 
again, however, that for me the type of response I am interested in is 
always to be identified as a pattern of organism-environment rearrange- 
ments and not as a detailed set of muscular or glandular activities. 
These latter may vary from trial to trial and yet the total “perform- 
ance” remain the same. Thus, for example, “going towards a light” 
is a performance in my sense of the term and is not properly a response 
(a set of muscular contractions). That is, it does not fit into the original 
Hullian definition of a response, although the Hullians often seem to 
forget this fact and tend to talk of “going toward the light” as if it 
were specified by a specific set of muscular contractions. In so doing 
they seem to me to cheat themselves and to make confusion for the rest 
of us. 

Independent variables (past). Return now and look in more detail 
at the complex symbol for the past independent variables—those 
involved in previous trials, or in any previous training relatively sim- 
ilar to the trial in question. This symbol is 2{ [Dz ][(S:R1 > S): 
SR mt) (SR; Sy) - (S7— 53 )] > [Dr,]}.6 is the symbol 
for the frequencies, recencies, and distribution of such previous trials or 
previous training. Dy, represents the hunger drive-stimulation (the 
hungriness and its magnitude) which was present and operative 
throughout each of these preceding trials. (I have therefore placed this 
Dy, in a bracket coordinate with the bracket containing the paren- 
theses representing the successive stimulus — response — stimulus 
events involved in each trial.) And Dz, represents the hunger drive- 
stimulation and its magnitude present at the termination of each such 
previous trial. Dy, has usually been assumed by both experimenters 

“The independent and dependent variables have been symbolized by capital 


letters. And a single then-and-there immediately present instance of one of these 
independent or dependent variables is symbolized by adding an asterisk to the 


* The trial in question may, of course, be the first one in the given setup. 
° The dots between the parentheses symbolize “and.” 
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and theoreticians to be less in magnitude than Du,. But we need more 
evidence than we now have to be certain that this is always the case. 
There are certainly some findings, e.g., Bruce [9], which suggest that 
the result of the reaching of a small amount of food (or, in the case of 
thirst, a small amount of water) following an early trial is actually to 
enhance rather than to decrease the drive-stimulation.? 

Let us consider further for a moment the nature of the Dyzs and 
Dn,’s. I have called them “drive-stimulations” rather than simply 
“drives” or simply “stimuli? to distinguish them as internal conditions 
which have properties both similar to and different from those of 
ordinary stimuli and also different from those of drives as often con- 
ceived. In fact, such stimulations have, I believe, three kinds of property, 
viz., they can be discriminated by the organism (at least to some ex- 
tent) one from another; they have energizing properties; and they 
have value-giving properties. 

First, they can be discriminated. Drives can, at least to some extent, 

be discriminated by organisms, even by rats. Thus it appears that 
rats can be trained to do one performance for one degree of initial 
hunger Dy, and to do another performance for another degree of hun- 
ger Di,’ [7, 36]. And also rats are able to distinguish between two dif- 
ferent drives such as hunger and thirst in the sense that they can make 
one performance for thirst Dy, and another for hunger Dy, [2, 8, 29, 43]. 
And certainly human beings, if not rats, can to some extent actually 
rate the degrees of their immediate hungers (their Dy,’s) and also 
probably their terminal hungers (their Dzs). 
À Second, drive-stimulations have energizing properties. In say- 
ing that a drive-stimulation (for example, a Dy,) energizes, I refer to 
two different types of fact: (1) an initial hunger drive-stimulation tends 
to lower thresholds for both foodlike objects and for to-be-explored 
objects and for eating responses to the former and for exploring re- 
Sponses to the latter; (2) the second fact is that these explorations and 
eatings will exhibit trial and error until success is reached. 8 This matter 
of “success” leads us to the third property of drive-stimulations. 


" This seeming fact raises doubt concerning the doctrine of drive-reduction as 
the basis of reinforcement. However, the doctrine of drive-reduction is not one 
Which I shall argue for or against here. I shall come back to it later. 

* Trial-and-error exploratoriness I conceive to be a function of any drive, e.g., 

‘unger, thirst, sex, or what not. I also assume, however, that in addition to such 
Major biological drives there also often appears a pure curiosity drive. The goal 
of this latter is the experience of distant stimuli merely per se and it also is ac- 
companied by its own exploratory trial and error. For exciting experimental evi- 
dence for such a curiosity drive, see Berlyne [5] and the investigations of Mont- 
Somery and Harlow and of their associates [22, 54, 55, 56, 57]. 

Finally, it may be noted that there is perhaps a curvilinear relationship between 
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Third, drive-stimulations have value-giving properties. A drive- 
stimulation sets up both positive and negative goals. That is to 
say, a drive-stimulation such as hunger lends positive value to one 
specific type of terminal stimulus or stimuli and also it may (at 
least temporarily) lend negative value to any stimuli which are 
not the final goal stimuli.” This presence of positive and negative 
values is symbolized in Fig. 1 by the plus superscript on the posi- 
tive terminal stimulus S;,* and the minus superscript on the stim- 
ulus’ of Sm”. If it would not have confused the diagram, I would 
have drawn arrows directly from Dy, to these positive and negative 
goal stimuli (S,,+ and Sn) to indicate that the positive and nega- 
tive values on them are the direct product of the governing drive- 
stimulation Dy,. 

Finally, before leaving this consideration of the drive-stimulations, 


alternative (SRS) routes the organism will, generally speaking, 
come to select the easiest and/or most probable route. Although most 
psychologists have, it seems to me, either explicitly or implicitly rec- 


Past, see especially [88]—is to postulate that, along with each major 
governing drive-stimulation, there is also involved an additional drive 


mon sense terms, if the major drive-stimulation is too weak, the animal is not 
sufficiently “interested” to examine widely and, if the major drive-stimulation is 
too strong, he tends to be too “tense” to examine widely [15]. 

9 


” Sm is a symbol for all the “mouth,” stomach, and nutritional stimuli resulting 
from eating and Sz is a symbol for any stimuli which are not such final consum- 
matory stimuli. 
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to be called that against effort and uncertainty and/or that for little effort 
and for certainty. ™! 

But let us turn now to the meaning of the four parentheses within 
the second bracket, viz., 


[SR — Sy) + (SR, > Sm*) + (SRi (> Sm )] 


The parenthesis (SR > S 7) symbolizes, as an independent vari- 
able, the fact that the manipulatory response Rı of responding to the 
given alley 5,1? by going down it resulted on each trial in a conse- 
quent reaching of the further stimulus of food Sy. It is important to 
emphasize once more that this performance Rı of going down the 
alley is to be identified as a discriminable general organism-environ- 
ment rearrangement. It gets the animal to the end of the alley and may 
maintain its identity over a range of different sets of muscle contrac- 
tions on different occasions. 

Furthermore, it is to be emphasized that $, (the alley as stimulus) 
is to be specified, not in terms of a single atomlike stimulus or in terms 
of simple relatively “proximal” effects, see Brunswik [10, 89], upon the 
Sensory surfaces but rather as a complicated “distal”? stimulus-object 
which also includes a set of associated behavior-support stimuli. 1° 
That is, S; symbolizes the alley as a pattern of potential-releasing 
Stimuli plus a pattern of immediate “behavior-support” or feedback 
stimuli. 

In other words (whatever I may have said earlier) I am now sug- 
gesting that behavior-support properties are also to be conceived as 
Stimulus patterns, i.e., as some characteristic pattern of feedback stimuli 
(from distance receptors and/or tactile and proprioceptive receptors) 
which result as the animal makes a given performance and which are 
necessary Supports if this performance is to be carried out successfully. 


" Another type of argument advanced by Hull [28] for explaining the choosing 
of a shorter route was the assumption that the shorter route would as such be more 
rapidly learned, My own bias as a result of some of the latent learning experiments, 
especially that of Blodgett [6], in which he used a simple two-way maze, is to 
believe that the longer route may be equally rapidly learned, but that when a goal 
'S introduced the longer route by virtue of its length becomes less preferred simply 

Scause of the drive-stimulation against work. (Brunswik, at least in private con- 
Versations, has contended that the preference for a shorter route may perhaps 
really be subsumed under the heading of choosing the more certain route.) In any 
ate it appears that such a doctrine as I am here proposing involves the assump- 
‘on that work or uncertainty comes, as such, to be discriminated by the organism 
ended be accompanied by negative values, 

am here assuming the case of a single alley with food at the end of it. 
Or an initial discussion of behavior-supports see Tolman [78]. 
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Turn now to the parenthesis (S,;R; —> Sm*). This symbolizes the 
further fact that R; (the response of eating the food $4) results in the 
reaching of a set of final goal stimuli S,,*. These goal stimuli S,,* are 
to be conceived as stimuli in the mouth, the stomach, and to some ex- 
tent in the underlying nutritional system. And the plus superscript 
accompanying this Sm* symbolizes the fact, already discussed above, - 
that there will be persistence and trial and error until this Sm” is 
reached, that in other words S,,* is to be conceived as having “positive 
value.” 

The parenthesis (SR; — S; 7) symbolizes the fact that, when the rat 
does not run down the alley R}, he stays in the presence of, or reaches, 
nonfood stimuli $7. And the parenthesis ($7— Sa) symbolizes the 
further fact that a stimulus of nonfood JS 7 leads to (is accompanied by) 
stimuli which are nongustatory stimuli Sy”. And these nongustatory 
stimuli have, as such, a negative value, 

Including now also the bracketed Dn, and the bracketed D;,,, the 
total brace represents the fact that in the preceding trials each of the 
following things has happened. There has been a hunger drive-stimula- 
tion Dy, Further, in the presence of this hunger drive-stimulation the 
rats have made the response Rı of going down the alley $} and this 
response has got them to the food stimuli S y. And they have made the 
further response of eating R; in the presence of these food stimuli 
Sy and this latter response has got them to the internal goal stimuli 
Smt, where these goal stimuli have been positive in the sense that the 
rats have shown exploration tendencies and persistence on each trial 
until these goal stimuli ‚+ were reached. And in the course of their 
explorations they have made other responses than R, (i.e., they have 
made R;) and these Rı responses have led to nonfood stimuli ‚7. And 
these nonfood stimuli $! 7 have led to, or been accompanied by, nongoal 
stimuli Sa”, where these latter have been negative in the sense that as 
long as the Sy” stimuli were present the rats have continued to make 
exploratory responses. Finally, each trial has ended with the rats 


“See the work of Neal Miller and his associates [4, 39, 53]. These workers 
made fistulas in the stomachs of their rats by which actual food or saline solu- 
tions or balloons (which could be blown up) could be introduced and the results 
of these procedures could be compared with those of normal eating through the 
mouth. By various permutations and combinations of these procedures Miller and 
his associates discovered that mouth stimuli, stomach stimuli, and nutritional 
stimuli were all involved as the goal of eating. Sheffield and Robie [74] also found 
that saccharin, a nonnutritive taste, merely as such, constitutes a goal stimulus. 
P. T. Young’s [97, 98] findings also suggest the fact that mere mouth stimuli like- 
wise constitute a significant part of the total goal. For convenience I have used 
the subscript m to cover all these resultant mouth, stomach, and nutritional system 
stimuli. 
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reaching S,,* which has been presumably produced some new value of 
the hunger drive-stimulation Du;—equal to, greater than, or less 
than Dy,. 

Means-end readinesses. Before considering the other, “present,” inde- 
pendent variables, let us look now at the first column of intervening 
variables, labeled “means-end readinesses.” As the causal arrow indi- 
cates, I have assumed that the frequencies, recencies, and distributions 
2 of the previous trials under such and such drive-stimulation condi- 
tions Dys have produced directly a corresponding set of means-end 
readinesses. It will be noted that, whereas the independent variables 
were symbolized by capitals, these means-end readinesses have been 
symbolized by lower-case letters. Further, these means-end readi- 
nesses are to be thought of as acquired cognitive dispositions result- 
ing directly from preceding trials or from other, related, previous 
training. And, as such acquired dispositions, they are conceived to be 
One of the major determiners of the final performance. These acquired 
dispositions or readinesses can be thought of as dispositional sets for 
certain types of (stimulus — response — stimulus)-sequences or for cer- 
tain types of (stimulus — stimulus)-sequence. In other words, when an 
instance of an sı is presented, there tends to be released an expectancy 
(see next column in the chart) that an instance of the kind of perform- 
ance symbolized by rı will lead to an instance of 52 (or else simply that 
the Presentation of an instance of an sı will by itself lead to, or be 
accompanied by, an instance of an 52).16 

But let us examine now in more detail the symbolic expression for 
the total Series of means-end readinesses, viz., 


[siri > Sy) ° (rm): (si> 57) - (7 sm)] 


This Stands for the fact that the animal has acquired a disposition or 
Set to the effect that an sı type of stimulus, when manipulated by an ry 
Uhe of response, will lead to a food stimulus of type sy; and that he has 
also Tequired a set to the effect that food stimuli of the type sy, when 
Manipulated by the eating type of response rz, will lead to mouth, 
stomach, and nutritional goal stimuli of type Sm; and that he has 
acquired a set to the effect that not performing an rı (i.e., 71) will lead 
to, or leave him in the presence of, nonfood stimuli of type 57; and 
that finally he has acquired a set to the effect that nonfood stimuli SF 
wall, simply as such, be accompanied by nongustatory stimuli sp. He 
as learned all this and these dispositions or sets remain present in him 
or, if you Prefer, in his nervous system) whether or not he is hungry at the 


al also call them “beliefs.” 
€ans-end readinesses or beliefs are symbolized by lower-case letters. 
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moment"? and whether or not any instances of the type of stimuli and responses 
in question are present at the moment. 

Further, it is to be emphasized that separate items in this total 
set of means-end readinesses can be present and be ready to function 
without the others. Thus, the animal can have the readiness (sry Sm) 
without then having the readiness that (sırı > sy). That is, he can have 
learned that eating the given type of food leads to certain types of 
gustatory (mouth, etc.) stimuli without having yet learned that going 
down the given type of alley leads to this type of food. And he can 
have the readiness (57 > s7), i.e., no food leads to nongustatory stimuli, 
without yet having the readiness (s7;—> sz). Indeed the fact that 
animals are usually trained to eat the given food, before the experiment 
proper has begun, means that they are extremely likely to have the 
readinesses (syry — sm) before they have the one (sırı — sy). And, 
owing to all their past experiences, they are also likely to have the 
readiness (s7 — sm) before they have acquired the one relative to the 
specific alley that (sırı — 57). 

Finally, it is my assumption (see Fig. 1 again) that all these readi- 
nesses expressed in the total bracket 


[rn > s) ` (Spry > sm) * (171 — 57) ` (7 > sa)] 
can telescope into the two more comprehensive ones: 
[lsri > sm) * (si > sm)] 


This latter symbol represents the over-all readinesses to the effect 
that “going down” rı the specific type of alley sı will lead to a gusta- 
tory goal stimulus sm and that not going down 7; such an alley sı will 
lead to (or be accompanied by) a nongustatory stimulus sz. 

Further, I would like to suggest that such a telescoping is only a 
simple case of what is probably a general property of all readinesses. 
I would suggest, namely, that readinesses (beliefs) can combine and 
interact not only in this fashion of telescoping but in other ways as well. 
I would argue, in short, that the experiments on insight, reasoning, 
and short-cutting suggest that readinesses can be added and multiplied 
as well as telescoped. And when so added or multiplied, they may lead 
to new responses, never before exhibited. 

; "This is the reason that there are no plus or minus superscripts on the items 
in these means-end readinesses. The means-end readinesses are to be conceived as 
pure cognitive dispositions. This does not mean, however, that the motivational 
and value items in the independent variables (ie, Dy, and Dy, and the positive 
value on S#* and the negative value on Sz") may not have some influence on the 
acquisition of such means-end readinesses, But this question of the effect of “effect” 


upon learning will be postponed until we turn in our resumé to the Laws of 
learning. 
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of magnitude) and their degree of confidence in this belief as to the out- 
Come (the first type of magnitude). 

Independent variables (present). Let us now turn back to the two inde- 
Pendent variables in the column labeled “present.” These are symbol- 
ized as S,* and Dr,*. They are to be conceived as the characters and 
magnitudes of the alley stimulus S; and of the drive-stimulation Dr, 
Specifically present on the given trial. The asterisks are to indicate 
that each has a specific, particular, then-and-there value which may 
Or may not be different from the characters and magnitudes of these 
{wo variables in the preceding trials summarized by the symbol 2. 

Intervening variables (perceptions, expectancies, representations, and 
valences).19 Turn next to the second column under “Intervening vari- 
ables.” Tt will be noted that all the letters in this column are capitalized 
and have dots over them. The dots are to indicate that these symbols 
refer to intervening variables rather than to independent or dependent 
variables, And the symbols are capitalized to indicate that, like the 
independent and dependent variables, they are particular events, 
which are conceived to be present only on the individual occasion. To 
Sum up, they are capitalized to distinguish them from the means-end 
Teadinesses (symbolized by lower-case letters) ; but they have dots over 
them to distinguish them from the independent and dependent varia- 

les (i.e., the physically or physiologically defined stimuli, responses, 
an drive-stimulations). 

„Let US examine them one by one. Consider .$. Whereas the symbol 
1 (in the column for independent variables) stands for the pattern of 
stimuli produced by the actual physical presence of the alley, the 
Symbol $, stands for the resulting discrimination (“perception”) 20 


1 N P i 
ee or the Possibility of getting good confidence ratings in human subjects, see 
a et al. [33], 

ap  efceptions, expectancies, and representations are symbolized by capital letters 
ath dots over them. 


E A À : 
ya “perception” I do not, of course, mean a necessarily conscious event. 
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of this complex stimulus. (For the method of measuring or estimating 
such “‘perceptions” see below.) 

Similarly, whereas the symbol D,,* in the column for the inde- 
pendent variables stood for the specific hunger drive-stimulation on 
the particular occasion, as determined by what has actually been done 
to the animal (usually in the way of such and such a degree of depriva- 
tion), the symbol Dy, stands for the actual discriminated and effective 
(or, if you will permit me, “perceived”) drive-stimulation on that 
occasion. (This latter may be greater or less than objectively, bio- 
chemically, it should be.) Further, it will be observed that \S, is con- 
ceived to be caused by both $1” and Dy,.24 

Look next at the symbol [($1Rı > Sm) + (SıRı > Sin)]. This stands 
for two concrete expectancies: (1) that A (a “represented” response of 
going down the perceived alley $1) will result in a “represented” (and 
expected) pattern of gustatory stimuli S,,,, and (2) that a “represented” 
response of not-going-down Ä the perceived alley $, will result in a 
represented (and expected) pattern of nongoal stimuli $m. And as 
shown by the causal arrows, these two expectancies are conceived to be 
caused by the combined action of the means-end readiness (sırı > sm) 
and (5:7; — sm) and the here-and-now aroused perception of the pres- 
ence of the alley ($4). 

Note also that I have repeated $m and Sq in boxes below. This 
bringing them down is partly for convenience in diagraming. But it 
is also meant to suggest that as “representations” such “expected” 
entities can be perhaps in some degree separated from, and function 
independently of, the specific expectancies in which they are embodied. 

Finally, note that, when the “represented” $m and $a are com- 
bined with the action of the “effective” hunger Dy,, they become en- 
dowed with positive and negative valences.?? The fact that they 
acquire such valences is symbolized in the still lower boxes by the addi- 
tion of the plus and minus superscripts. 

It is to be noted that I here speak of “valences” rather than of 
“values.” The term “values” was used for the independent variables, 
symbolized by Sm" and Sy”. And such values are specified by the 
objective degree to which the animals, having reached an actual 
instance of an Sm, would tend on the further occasions to persist and to 
show trial and error in reaching it again; or, similarly, having got 


“i "This allows for the effects, such as they may be, of motives on perception. 
The hungry animal may well be more ready to perceive certain types of external 
stimulus situations than others. But, as we all know, the facts as to such effects, if 
any, are still perhaps dubious and certainly unclear. 

2 Valences are symbolized by the symbols for the expected good or bad goal 
stimuli with a * or a ~ superscript added. 
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away from an actual Sp, to what extent they would tend on subsequent 
Occasions to show restlessness until they got away from this Sp again. 
“Valences,” on the other hand, are conceived as the intervening 
variables, symbolized by $„* and Siz which function as such only on 
the individual occasion. They correspond to what the animal then 
and there “expects” in the way of “goodness” or “badness” (which 
may or may not correspond to the actual values as determined by the 
animal’s behavior when he has got to the corresponding actual S, 
or Sin). Thus, for example, an experimenter may in fact alter the situa- 
tion during or before the given trial so that the expected values, 
ie., the valences built up as a result of the preceding trials, will no 
longer correspond to the actual values. The experimenter may, for 
example, make the true consummatory value of the food S,,* either 
Sreater or less than it was on the preceding trials. The consummatory 
Consequence (the value) of the food, when got to, may thus be greater 
or less than expected, i.e., than “its valence (Snt) during the trial.” 
And pari passu the internal consequences of nonfood (value) may be 
made more or less than its expected value (valence). To use Lewin’s 
terminology, the valences are in the animal’s “life space,” whereas 
the values are in the effects of the actual environment upon the 
organism. 

Finally, I assume that it is the magnitudes (in both senses, see 
above) of the expectancies ($12, > $m) and ($121 > Sp) and the 
Magnitudes of the valences $m* and Sy” which act together to deter- 
mine the magnitude of the resulting dependent response (K, *) rather 
than its opposite R,*,28 

So much for a survey of the symbols in Fig. 1 and of the constructs 
for which they stand. Thus far I have merely stated these constructs 
and indicated the assumed causal interactions between them. That 
15, I have presented these constructs in what the outline for Project A 
would call my “system” language. How far can I now also state them 
Man “empirical language”? How far, in short, can I suggest empirical 
Pointer readings for them? 

Pointer readings. Let us look first at the independent variables. 

Sgin with 2, The symbol X stands for the frequency, recency, and 
distribution of previous trials. This construct of frequency, recency, 
and distribution Presents no great difficulty. The pointer readings for 
it are obtained by timepieces and counting. And these timings and 
Sountings give us relatively undisputed measures of what all of us 
Would mean by frequencies, recencies, and distributions. 

m oed represented, and valenced Sus and Sm”’s are ee co 
telatively nn ysis by the constructs of r, and sv. But I find that these hypothetical, 
Peripheral, constructs do not help me in my own thinking. 


110 EDWARD C. TOLMAN 


Look next at Dy, and Dy, the hunger drive-stimulations present 
at the beginning and end of each trial. As defined above, a hungriness 
(a Dn, or a Dr.) is a disposition to approach and to eat. My suggested 
pointer reading for either a Dy, or a Dy, will be some measure of a gen- 
eral tendency in the given state of the animal to approach and/or eat 
foods in general. This measure would have to be obtained in some 
agreed-upon standard situation (say, a Skinner box) and not in the ap- 
paratus used in the specific experiment under consideration. That is, 
before the beginning of each trial (in the case of a Dy,) a group of rats, 
comparable to those which had had the same number of preceding trials in 
the experiment proper, would have to be tested with a variety of foods 
in the Skinner box and average rates of pressing and/or eating meas- 
ured.*4 These measures would then be taken as the empirical pointer 
readings for the then-and-there Dy, Such a procedure would, of 
course, mean that the rats thus tested would then have to be sacrificed 
in the sense that they then could not be used further in the experiment 
proper. For the experience of pressing the lever and eating in a Skinner 
box just before the given trial would presumably to some degree distort 
the animals’ Dy, for the given trial. The total procedure would thus be 
a tremendous task and would involve the use of a great number of rats. 
Something like it, however, seems to me to be called for until some 
day we can perhaps get some direct biochemical measure of a Dy. 

It may be argued, however, that there is another less costly method 
for empirically estimating magnitudes of Dn,. Indeed, it has been as- 
sumed by most experimenters that hunger drive-stimulations Dy,’s are 
directly related to hours of food-deprivation or to maintained percent- 
ages of decrease in body weight. Thus it is assumed that if at the 
beginning of a given trial the deprivation has been for 22 hr—or if 
the body weight has been kept at 80 per cent of normal—then the hun- 
ger drive-stimulation, the Dy, will have a constant value for this trial. 

Tt also seems to be assumed—at least in the case of hours of dep- 
rivation—that relatively known curves have been established [95] 
between different hours of deprivation and different strengths of Din. 
That is, it is assumed, for example, that the Dns vary from 6 hr 
deprivation to 12 hr deprivation to 24 hr deprivation, or even to 
48 hr according to some relatively known curve. 

But there are, I believe, certain assumptions underlying all these 
thoughts and procedures about Dr's which need to be brought to 
light. We assume, I believe, that a drive-stimulation, such as hungri- 
ness, is fundamentally a biochemical-physiological state. But it has 

“Tt would be necessary that the rats had had plenty of previous training in 


the Skinner box beforehand so that there would be no problem of their having to 
learn what to do. 
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further been assumed that the actual magnitudes of this state are 
reflected rather directly by the number of crossings in the Columbia 
Obstruction Box, or by rates of pressing in the Skinner box. But these 
last assumptions seem shaky. It appears, therefore, that ultimately 
the desired measure for initial (or terminal) hungriness, or for any 
other basic drive-stimulation, must be some direct physiological or 
biochemical one—see Ritchie [72]. However, I shall nevertheless 
conclude that today, in default of any such direct measures, the best 
that we can do is to use, as has been suggested, some measure of general 
readiness to approach and/or consume a standard goal in a standard 
situation. 

Consider further the Dy, after each trial. For this also we must 
obtain an empirical pointer reading. Such a reading would have to be 
obtained after each trial. And again this may perhaps be done by some 
measure of the then-and-there new (posttrial) readinesses to approach 
and/or eat the standard food in a standard setup. These Dy, measures 
Would perhaps be even more troublesome to make than those for the 
Din. But it seems important to make them to discover whether actually 
the hungriness goes up or down after a given trial. As I have suggested 
above, my hunch is that it probably goes up after each of the first few 
trials, but that it probably goes down after later trials. 

Turn next to empirical measures of the qualitative natures of Sm 
and Sm. The only empirical attacks that we have on these to date are 
those of Miller and his coworkers already referred to. But what about 
the positive and negative values? Whereas Dy, and Dy, were conceived 
as the readinesses for food in general, Sm” is to be conceived rather as 
the readiness for the specific internal (mouth, stomach, and nutri- 
tional) stimuli resulting from the eating of the specific food in the specific 
Siluation. And Sy” is to be conceived as the unreadiness (the “discom- 
fort?) in the presence of any stimuli other than S,,* in this specific 
Situation. That is, both these “values” are to be measured directly in 

© specific situation. „+ might be obtained by some measure of 
actual rate of eating of the specific food in the goal-end of the specific 
Approach box at the termination of each trial. And Sim might be 
obtained by some measure of something like restlessness in the other 
Parts of the box on the given trial. These will be difficult measures to 
obtain but perhaps they can be arrived at. 

Pointer readings for intervening variables. Turn first to the means-end 
Teadinesses. The basic empirical setup, according to me, for obtaining 
Pointer Teadings for means-end readinesses would be transfer experi- 
ments. Such experiments would involve either a transfer from preced- 
Ig trials to the trial in question in the same apparatus, or they might 
Involve a transfer from a series of trials in some different situations to 
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the trial in question in the given situation. It is from such transfer 
experiments, with their favorable or negative effects upon the test 
trial, that we would infer the kinds and strengths of the means-end 
readinesses which were established by the training previous to the 
test trial and which are, therefore, to be assumed to explain the positive 
or negative transfer to that test trial. If the success and smoothness 
of the performance in the transferred-to test trial proves enhanced by 
the preceding training (or preceding trials in the same specific setup), 
then we must infer that the means-end readinesses (the beliefs) estab- 
lished by the preceding training were very similar to those which we 
assume as leading to the expectancy producing the given behavior in 
the given trial. Whereas, if the performance in the test trial is less 
correct than it would be without this preceding training, then we must 
infer that the means-end readinesses (beliefs) established by the 
preceding ‘training (or trials) were relatively incompatible with those 
required for the correct expectancy and correct performance. And by 
different sorts of such transfer experiments we could become more and 
more precise in our specifications. That is, we could learn more and 
more about the means-end readinesses which the animal brings with 
him as well as of the one operative in the test trial itself.25 

To test such a contention will, of course, require tremendous 
labor. And much ingenuity and insight will be required to devise and 
to carry out the needed experiments. Nevertheless, the notion I am 
arguing for does not seem altogether new. Learning psychologists have 
always assumed, I believe, that “habits” established in prior learning 
have an effect upon subsequent learning. And they have also assumed, 
I think, that the effect of such previous “habits” on the present learning 
will help to specify both the nature of those previous habits and the 
nature of the present habit being developed. The only really new 
gimmick I am introducing is the assumption that a habit, as a general- 
ized disposition, is to be conceived, not as a simple s-r connection, but 
rather as an sr, > $2 connection or an sı — s connection. And it is 
these sırı > sy and sı — sz connections which transfer and combine 
with one another in complex ways. But the laws of these transfers and 


* At this point it may be well to emphasize again that what I am calling means- 
end readiness or beliefs are really of two types. On the one hand, they have the 
general form (sırı > 52) and, on the other, they have the general form (51 > 52). 
Perhaps the term “means-end readiness” is really appropriate only to the first 


type of belief, the organism is teady to expect that, if an instance of sı is presented, 
a response of the rı type will lead to an instance of s» Given the second type of 
belief, the organism is ready to expect that, if an instance of sı be presented, then 
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interactions and their generalization are, I must admit, still very 
hazy.?6 

Let me sum up. A means-end readiness, as I conceive it, is a con- 
dition in the organism which is equivalent to what in ordinary 
parlance we call a “belief” (a readiness or disposition) to the effect that 
an instance of this sort of stimulus situation, if reacted to by an in- 
Stance of this sort of response, will lead to an instance of that sort of 
further stimulus situation, or else, that an instance of this sort of 
stimulus situation will simply by itself be accompanied, or followed, by 
an instance of that sort of stimulus situation. Further, I assume that the 
different readinesses or beliefs (dispositions) are stored up together 
(in the nervous system). When they are concretely activated in the 
form of expectancies they tend to interact and/or consolidate with one 
another. And I would also assert that “thinking,” as we know it in 
human beings, is in essence no more than an activated interplay among 


“The difficulty of devising any really good or easy empirical pointer readings 
for means-end readinesses (beliefs) has, however, made me wonder at times 
whether this concept should not be abandoned. And actually in various recent dis- 
cussions [87, 88] I have got along without it. The concept did, however, appear at 
Some length in my original book [78]. If it were to be omitted, one could still per- 
haps get by with the concept of more general expectancies—these more general 
&Xpectancies being a substitute for the concept of means-end readinesses—and 

is is what was actually done some years ago in a joint article with Krechevsky 
[90]. There is, however, a second feature in the original concept of the means-end 
teadiness (belief), as contrasted with the concept of the expectancy, that I would 
like to Preserve, namely, the notion that the means-end readiness (belief) is an 
enduring disposition, whereas the expectancy is a specific then-and-there activation 
(usually more specific though not necessarily so) of this disposition. 

Furthermore, as a “cryptophenomenologist” it seems to me that a means-end 
readiness or a belief is phenomenologically a rather different thing from an expect- 
ancy. A belief is an “if this sort of thing, then that sort of thing.” An expectancy, 
on the other hand, is a “now this concrete thing, therefore, that concrete thing.” 
And this difference I would like to preserve and I would like to hope that even- 
tually we can find really good and different sorts of pointer readings to parallel 
this difference, 

Finally, it will be recalled that it was suggested above that means-end readi- 
nesses interact by telescoping or multiplying with one another, thus producing new 
Ones which result in new expectancies and hence new performances. And it was 

SO suggested that such interactions among means-end readinesses explain short- 
ae Umweg, insight, and reasoning. I would now like to modify these sug- 
Bestions merely to the extent of emphasizing that it was also said that such inter- 
actions take place among “activated means-end readinesses” and it has now been 
emphasized that activated means-end readinesses are really expectancies, “Think- 
be,” in short, is probably to be thought of as an interaction among expectancies 
w ich Produce new expectancies. And then these new expectancies leave, as a 

pe Corresponding new means-end readinesses. And it is these latter which are 
en in the organism’s equipment for future occasions. 
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expectancies resulting from such previously acquired readinesses 
which result in new expectancies and resultant new means-end 
readinesses.?? 

Perceptions, expectancies, representations, and valences. Turn first to $a 
This is the symbol for the “perception” of $,*. What are the empirical 
pointer readings for such a perception? Apparently what in practice 
we actually do is to substitute, one by one, changes in the given 
stimulus situation to discover which of such changes do and which ones 
do not disrupt an established response. In this way we discover the 
precise range of physical S,*’s which will be equivalent in the sense 
that any one within this given range will tend to produce the given 
behavior. The $, is thus defined by this range of objectively defined 
Sis which do not cause disruption. An $ is, in short, a reevaluation by 
the experimenter of the physical $y’s in terms of what variations the 
latter can have and still produce the same response. In essence, it 
would seem that what we do is to perform the equivalent of a psycho- 
physical experiment (save that instead of using verbal judgments we 
use the going off, or not going off, of some overt response as our indi- 
cator). A couple of concrete examples would be the experiments of 
Honzik [25, 26] with the 14-unit elevated T maze and an experiment 
of Ritchie’s [71] with the elevated single T maze. Many more experi- 
ments of such types are, however, needed. 

Further, we shall also need pointer readings for the additional 
feedback stimuli (behavior-supports) involved in an $i. Although I 
believe such feedback stimuli to be a most important feature of an $y 
neither I nor anybody else seems to have done any good experiments 


"In order to draw the teeth of possible critics I am ready to admit that this 
matter of pointer readings for intervening variables—the means-end readinesses 
(beliefs) and also the expectancies, the representations, the perceptions, and the 
valences to be discussed below—is the weakest part of my theory, I would like to 
suggest again, however, that the hypothetical little r,'s and sy’s of the Hullians 
are almost equally difficult of empirical verification. The experiment of Seward 
[73] which showed that a white box at the end of the maze must produce a little 
fs and resultant proprioceptive s, different from those produced by a black box 
seems to indicate that the hypothetical 7,'s and resultant s’s can no longer be 


and Phenomenology and talk directly about “believed,” “expected,” “represented,” 
and “perceived” entities for which, by using many behavioral experiments, we may 
hope eventually to get pointer readings. If we can’t ever get adequate behavioral 
pointer readings, then, it seems to me, we shall have to give up and wait (and it 
may be a good long time) for our biochemical and neurophysiological friends to 
catch up and help us out. But I don’t think that in the meantime “fake” peripheral 
constructs such as little r,'s and s,'s are going to help very much. (But this, no 


teeta 
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with rats on this problem. They would have to be experiments in 
which the maze or other apparatus “looked” as if it would support such 
and such behaviors (provide appropriate feedback stimuli for) but in 
fact would not. It is to be noted that the experiments on visual illusions 
with human subjects initiated by Ames and expanded by Cantril and 
his associates [34] come closest to what I have in mind. What we want 
to do is to vary the actual geographical, mechanical, and gravitational 
features with their resultant feedback stimuli connected with a given 
“appearing” setup and see which variations in these resultant feedback 
stimuli would and which ones would not cause disruption of (make 
impossible) the initial response produced by Sı. 

Consider next Dy,. This, it will be remembered, is the symbol for 
the discriminative perception (not necessarily, or usually, conscious) 
of the hunger drive-stimulation. That is, it is the symbol for that range 
of drive-stimulations Dns (probably specifiable today only in terms of 
degrees of deprivation, see above) over which variations can be made 
without causing a disruption of the behavior. That is, there will be a 
range of the actual Drs which will all be reacted to as alike (i.e., not 
Cause disruption) in the character and magnitude of the behavior. 
This range of Dys will define the Di. 

With human beings we can do this perhaps fairly casily. We can 
discover those ranges of actual drive-stimulations (measured by hours 
of deprivation or what not) which will be reported as alike by the sub- 
ject and those which will not. Again this would be like a psychophysi- 
cal experiment, 

I say here nothing about (and the chart does not include) Dns. 
For, according to my analysis, the Dy,’s, the terminal drive-stimula- 
tons, are not perceived, represented, or valenced. A better and more 
complete analysis, might someday have to include them. But I shall 
leave that for the future, 

Next turn to the expectancies [($ 2; > Sm) © (S:81— $a)]. Such 
expectancies, it is to be remembered, are not more than activated 
readinesses from the storehouse of readinesses plus the perception ($,) 
Of the presence of an initial sign stimulus which fits into the category, sı. 

The question as to the empirical pointer readings for such expect- 
ancies can be answered, I believe, by varying the actual (SRi > S;,)- 
type sequences in the physical and physiological worlds and discover- 
ing which tanges do and which do not allow the given behavior to go 
off undisrupted. Again we need psychophysical sorts of experiments. 

ut this time ones re external and internal environmental sequences and 
hot merely re the individual items in these sequences. 

Finally, we have to consider the problem of pointer readings for 
the represented and valenced $„* and Sn”. The discriminated (effec- 
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tive) Dy, leads to a positive valence on the “represented” $m and a 
negative valence on the “represented” $a. How do we get pointer 
readings for these expected stimuli with their “expected” values (i.e., 
their valences)? 

Miller and his associates, as we saw above, indicated a way of 
specifying both the characters and the positive valences by deter- 
mining how much more milk rats of a given deprivation (Dy,) would 
drink when a given $m has been artificially provided (by blowing up 
the stomach, by putting actual milk into the stomach, or by putting it 
into the mouth). And in this way it was discovered in a rough way 
what internal stimuli served as goals and how strong their respective 
positive valences were. 

But many more experiments need to be done with different values 
of Di. (Miller and associates used only one value of Dy,—very care- 
fully controlled in terms of time of test since previous feeding or since 
preceding injection and also by careful control of daily caloric intake.) 

Finally, the pointer reading for the dependent variable (Rı*) is 
merely some agreed-upon objective observation of whether the rat 
actually goes down the alley, crosses a certain mark with his tail, or 
what not. 

Summary re pointer readings. Let me briefly sum up what has been 
said concerning empirical definitions (pointer readings). Most of the 
independent and the dependent variables are to be specified by agreed- 
upon statements as to what external or internal environmental events 
(i.e., Zs, Sys, Ris, Sos, Sms, etc.) are or are not, have or have not been, 
present. In the case of drive states (Ds and Dys) the pointer readings 
are less clear. At present we have to satisfy ourselves with such meas- 
ures as hours of deprivation, percentage losses in body weight, or 
approach and/or eating tendencies to food in general in some agreed- 
upon “standard situation” such as the Skinner box. 

Means-end readinesses (beliefs) are to be got at by transfer experi- 
ments. But it must be admitted that precise pointer readings for them 
have not yet been achieved. 

b Perceptions, expectancies, and representations are to be got at by 
disruption experiments carried out relative to each specific concrete 
setup. In essence these disruption experiments would be psycho- 
physical types of experiments. But again it must be admitted that the 
ground has only been scratched as far as getting precise pointer read- 
ings is concerned. Positive and negative valences are also measured by 
disruption experiments in which goal stimuli with various “values” 
are provided artificially to see to what extent these do or do not modify 
the original behavior. Again only a beginning in the direction of the 
necessary studies has been made. 


Pa See 
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Tn the above discussion of pointer readings I have slid over the 
problem of rs and Rs. The rs and 7s in the means-end readinesses and 
the Rs and the Rs in the expectancies must also be got at by pointer 
readings. We must know what rs or 7;s the organism brings, as readi- 
nesses, to the given situation and what concrete instances of these rs 
and 7s (i.e., Rs and not Rs) are activated by the concrete presentation 
of $y. 

Also it must be noted that the rs and 7s and the Rs and Äs have a 
further aspect, in addition to their mere qualitative identifying char- 
acteristics. This further aspect is their work or uncertainty characteristics 
(see above). The organism brings not only a belief that a given type of 
r or # will get him to a given type of są and has not only an expectancy 
that a concrete instance (R or R) of such an r or 7 will in the given 
Situation get him to a concrete $», but he also believes and expects that 
given specific amounts of work or effort will be involved. For, as has 
been indicated, I further assume that accompanying any major drive- 
stimulation there is always also a drive-stimulation against work. As a 
result of this drive-stimulation against work there will be negative 
valences on the As and As contained in the expectancies. For the chart 
really to have been complete a Dy,* and a Dy, (drive against work) 
should have been added and negative valences should also have been 
added to these represented Rs and £s. I have left all these out because 
it would have made the chart too complicated. I have also left them 
out, I expect, because I do not know how to get good pointer readings 
for these work features and their accompanying valences and have not 
really thought the problem through. 

So much for the discussion of a case of simple approach to food 
when the animal is hungry. Let us turn now to a case of simple escape. 

A case of escape. Figure 2 represents the case for a simple instance 
of escape. The complex symbol 


[DS — Sy) « (SiR: > S2) (HR Sp*)] > [Dr,]} 


Stands for the frequencies, recencies, and distribution of previous trials 
in the given situation (or in some similar situation) in which, under an 
Initial fear drive-stimulation, i.e., a degree of “fearfulness” or “timid- 
tty,” Dr, a stimulus situation Sı has led to, or been accompanied by, 
“negatively valued pain stimulus S,~; and a manipulatory response 
1 to this S, has led to a new stimulus situation Sz; and this new 
Stimulus situation S, has led to (or been accompanied by) a positively 
Valued nonpain stimulus Sp"; and the whole experience has resulted in 
Some new degree of fearfulness Dy,, which may have been greater than, 

less than, or equal to the original fearfulness Dr,. 
Tn the usual rat experiment the S; is, of course, a box with an 
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electrified grid on the floor. The Sp is the resulting pain stimulus 
from the grid. The R, is the response (performance) of jumping off, 
running off, pressing a treadle, or whatever. And the S is another 
adjacent box or another part of the same box with no grid in it, or, in 
the case of the treadle, it may be the same box and grid but now with 
the treadle down. 

Look next at the symbols for the means-end readinesses (beliefs) 
which I assume to have got established as a result of such previous 
trials. These means-end readinesses or beliefs are represented by the 


Independent variables Intervening variables Dependi 
| en 

Past Pres- Means -end T Expectations, perceptions, varia- 
ent readinesses representations, valences ble 
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Fic, 2. Escape 


and that an rı type of response to this sı will lead to a new external 
type of stimulus situation $2; and that this latter will lead to (or be 
accompanied by) a nonpain type of stimulus situation Sp. 

Look back now at the independent variables (in the second 
column)—those assumed to be ‘‘present” at the beginning of the given 
trial. These are represented by the symbols S;* and Dy,*: Sy* stands for 
the box with the grid as it is presented to the rat on the given trial. 
And Dr,* stands for the degree of timidity (fearfulness) which the ani- 
mal brings to this trial, 

Consider next the second column of intervening variables. We see 
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that the external stimulus of the box and grid $1* is conceived to lead 
to a perception of itself $}, and that the independently measured fear- 

less, or timidity, is conceived to lead to a discriminated (“per- 
ceived?) effective fearfulness Dr,. The diagram also indicated that Dr, 
may affect the perception $4. Again we see that $}, combined with the 
teadinesses produced by the preceding trials, is assumed to produce, on 
the given trial, a set of concrete expectancies as follows: that the per- 
ceived $; will lead to, or be accompanied by, an expected pain stimulus 
$p; and that the response Rı to the perceived stimulus $; is expected to 
lead to a different stimulus situation $»; and that this latter will be 
accompanied by a nonpain stimulus situation Sy. Finally, it appears 
that the represented Śp combined with the perceived timidity Dy, on 
the given trial will cause a negative valence on $, (that is, S,~); and 
that this Dr, combined with the represented $, will also cause a posi- 
tive valence on $» (that is, Spt). 

Finally, we see that the expectancy complex (with its included 
representations and their attached valences) is assumed to determine 
the strength of the independent variable 2, *—the actual tendency to 
Jump off, to press the lever, or whatever it may be.?s 

Empirical pointer readings. The methods of getting empirical pointer 
readings for these variables will be much the same as it was in the 
approach case. 

Independent and dependent variables. 2 will be measured by timings 
and countings. Dr, and Dr, will be determined by some measure of 

© standard fear response in a standard situation. And the various 
Ss and Rs will be specified by common-sense physics or, in the case of 
n, by some as yet perhaps very largely hypothetical physiology. 

Intervening variables. The readinesses caused by the previous trials 
Or other previous training (sı > sp) and (sırı > 52) and (ss — sp) will 

determined by transfer experiments. And the expectancies, per- 
ceptions, representations, and valences on the given trial will be de- 
termined by disruption experiments in the given apparatus for the 
Siven trial. All this will certaintly be a difficult job. But this does not, 

elieve, invalidate the conceptual and experimental suggestiveness of 
Such constructs, 
Case of avoidance. Let us turn now toa case of simple avoidance 
earning, as diagramed in Fig. 3. In the column for “past”? independent 
Variables we find the complex symbol: 


ZU[D.][SıR, > s2) (S2 > Sp) + (Sik > S5*)] ci [Dr.]} 


This differs from that for the setup for escape in that here the animal 
5 not presented with pain S, in the presence of S4. But, if he makes a 


2 A : t 
Again a negative work valence on R, should have been included. 
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response 2; to Sı this leads him to S, and then this S: will bring about, 
willy nilly, a negative pain stimulus Sp_- Or, concretely, in the usual 
experiment the rat is placed in a box or alley in which he does not get a 
shock unless he makes some specified response R;, unless, that is, he 
steps into another compartment, or presses a lever or goes in a certain 
direction, or the like. Such an R; leads to Sz (a new stimulus situation), 
But getting to this $S} means that a shock is encountered, that is, a pain 
stimulus, Sp. If the rat does not do Rı (i.e., if he does anything which 
may be labeled R,), then he remains in the presence of nonpain S;. 


Independent variables Intervening variables Depend-) 
= — ent 
Past Pres-| Means-end Expectations, perceptions, varia- 
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The rest of the diagram follows in the same manner as in Figs. 1 
and 2 and needs no special comment. The one distinctive feature of this 
avoidance case is that the dependent variable measured now is the 
degree to which the animal does not do Rı—in other words, the magni- 
tude of R, *,29 

Empirical pointer readings will present similar and equally diffi- 
cult problems as in the cases of approach and escape. But I will not 
attempt to spell them out. 

Choice-point learning. Let us consider now the very common 
type of setup, that of a maze or discrimination box, in which the rats 
are faced, not with the alternative of merely doing something or not doing it, 


20 - . 
Here, also, there will be negative valences on the amounts of work involved 
in Rı or Rı. 


Principles of Purposive Behavior 121 


but rather with the alternative of doing one or another of two different 
things. The single T maze or Y maze or the simple discrimination box 
are examples. The analysis for such cases is presented in Fig. 4. 

The really distinctive difference between this figure and the three 
previous ones is that here, in Fig. 4, the alternative character of the 
two responses Rı and Rı’ is emphasized and also the fact that these 
two responses lead to two alternative consequences ($, and $3’), one 
of which then leads to a positive goal stimulus (S,,*) and the other to a 
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Fic. 4. Choice-point learning 


negative nongoal stimulus Sa. This has the further consequence that 
In observing the final behavior the experimenter records not only 
the proportion of Rı responses but also the gradually decreasing pro- 
Portion of R,’ responses. And at certain definite stages of learning the 
€xperimenter may also observe and record tentative oscillations be- 
tween the two (i.e., vicarious trial and error, VTE). 

Of Course, it might have been equally appropriate in the three 
Previous figures to have indicated the alternativeness between Rı and 
“1. But such experiments are usually set up so that R, is not a clearly 
identifiable entity. It may be anything different from Rı such as run- 
ning to the other end of the box, cleaning the whiskers, or it may be 
Merely sitting still, Therefore, the three previous charts seem a more 
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appropriate depiction of the facts as we now have them. In the future, 
however, experiments may perhaps be carried out in which the &’s can 
be given more precise experimental specification. If this latter proves 
the case, figures more similar to F ig. 4 will also become appropriate 
for the three previous cases. 

Figure 4 as a whole follows the same principles as the previous 
ones and needs no new special comment. And the problems of the 
empirical pointer readings will also be much the same. 
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Fic. 5. Early in learning—first period in latent learning 


Early in learning and latent and incidental learning. I shall 
turn now to an analysis of the situation in which again two alternative 


tl 
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Fig. 5 the situation is one in which both alternative consequences are, 
to all intents and purposes, equally good. Either the rats have not yet 
learned that there is a difference between the two outcomes or, as in 
the first phase of the latent or incidental learning experiment, actually 
there is no difference between the two outcomes. 

The distinctive new assumption which now has to be added and 
which is represented in Fig. 5 is the assumption that any main drive 
such as hunger will also have associated with it an auxiliary cognitive 
or curiosity drive (see footnote 8), represented by symbolizing the 
initial drive-stimulation as Dy, where the “c” indicates the curiosity 
drive. And also in line with this same assumption I have symbolized 
the goal stimuli as §,+. I have assumed, in short, that any drive- 
Stimulation such as hungriness (or fearfulness) has to some degree a 
pure cognitive or “curiosity” drive-stimulation associated with it. 
This latter has its own goal—namely, some characteristic pattern of 
external and/or internal stimuli, whatever it may be, which results 
from examining. And as long as the presence of the specific goal for 
the main drive is not present (as in the first phase of the Blodgett type 
of latent learning experiment) or is not “remembered” as in the early 
trials of the usual maze, then the animals get “rewarded” curiosity- 
Wise by all the alternative paths, and the animals take them approxi- 
mately 50:50. 

In later trials (or after food has been introduced as in the Blodgett- 
type experiment) then the valences Sm? and the Sa corresponding to 
hunger far outweigh the mere curiosity valences S,+. And hence the 
response of going in the food direction comes to dominate. That is to 
Say, then Fig. 4 will again more truly depict the situation. 

Summary of the five learning paradigms. This ends my analysis 
of the common types of setup in rat learning experiments. I have pre- 
sented five paradigms for such experiments and I have indicated the 
Variables which seem to be involved according to my system in each 
paradigm, And I have suggested procedures for obtaining pointer 
Teadings. There are, however, a number of very basic questions raised 

Y my system which have not yet been answered. The most obvious 
question is: What do I conceive to be the quantitative laws of learn- 
ing; what are the relative importances of frequency, recency, and 
distribution, on the one hand, and of “effect” (reinforcement) on the 
other; what, then, do I consider to be the laws of learning??? 

A i may be as good a place as any to emphasize my very real ne 
Otquodale and Meehl [50] for their analysis of my expectancy theory o 
8. I have borrowed a number of their concepts, although they may well not 
Tecognize them as such. For example, I believe that my present introduction of the 
(a> s) type of means-end readiness and of the (Ss =: $a) type of expectancy is 
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Laws of learning. Learning, as has been indicated, is conceived by 
me as the acquisition of (sırı — s2) connections or of (sı > s2) connec- 
tions. Such connections I have called means-end readinesses or beliefs, 
It is the nature of the laws determining the acquisition of such readi- 
nesses or beliefs which we are now to consider. 

What are the conditions under which such beliefs are formed and 
strengthened? For the purposes of this discussion I shall consider only 
cases of 100 per cent original reinforcement (see footnote 31). This 
seems to be the type of case which most theorists have had in mind 
when they constructed their theories of learning. And it certainly was 
the one I had in mind when I constructed mine.*? 

In general there are two main classes of conditions which have 
been advanced. These may be roughly identified with Thorndike’s 


closely related to their Postulate 4 (50, p. 242]. And previous to their discussion I 
had never clearly seen that when, for example, I talked about “equivalence be- 
liefs” I was really talking about (S$, — $2) learning, 

Tt was they also, I believe, who revivified my belief that mine is mostly an 
(SRI>S:) type of theory and not, as Spence [75] suggested, primarily an 
(Si Ss) type of theory. However, as I have just said, I do now also include 
purely (Sı — $:) types of learning. 

“Tt must be noted again in passing that beliefs or readinesses seem to have 
two kinds of strengths. Thus, a belief may have strength because it holds that the 
$ type of stimulus will follow the s type of stimulus in a high percentage „of 
instances. Or a belief may have strength (even though the expected frequency of 
the s+ consequence is low), in the sense that it is held to very strongly. This latter 
is, I suspect, the type of Strength of belief we usually think of. 

Tt should be noted further that there may be some form of interaction between 


2 has followed in less than 100 per cent of the trials (i.e., partial reinforcement) 


trials to extinguish than will a belief established in a setup in which during learn- 
ing there had been 100 per cent reinforcement. But it may well be that a fairer 


then perhaps a fairer comparison would be between the number 6f four-trial 
blocks necessary for extinction in the partial reinforcement case as against the 
number of single-trial blocks necessary for extinction in the 100 per cent rein- 
forcement case, 

k Actually, however, I believe our theories and our experimental designs for 
testing our theories of learning will need many revisions when we have allowed 
ourselves really to go to town on all the implications of partial reinforcement situa- 
tions or, as Brunswik would have phrased it, when we have really accepted the 
essentially “probabilistic” character [10, 89] of the environmental relations relative 
to which the organism usually learns. 
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laws of “Exercise” and of “Effect.” Under “Exercise” I should put all 
the factors of frequency, recency, and distributions of trials sum- 
marized by the symbol X in the charts, And under “Effect” I should 
put such factors as the magnitudes and characters of the final values 
(e.g., the Sms and the Sp~s and the ,*’s) and/or the terminal drive- 
stimulations (the D»’s). 

We now all agree in a general way as to how “exercise”? (i.e., fre- 
quency, recency, and distribution of trials) affect learning. But there 
still seems to be considerable disagreement as to the nature and 
importance of “effects.” My own position in the past was that exercise 
factors determine the results almost exclusively. And it was, of course, 
the latent learning experiments [77] (in those cases in which latent 
learning was demonstrated) which gave me this bias. Recently, how- 
ever, the findings of Postman and Adams [69, 70] on incidental learn- 
ing in human subjects, together with further thinking about those 
negative cases with rats in which latent learning has not been found, 
have led me now to believe that “effect,” in the sense of final positive 
or negative values attached to the terminal stimuli, does also always 
play some part in favoring the acquisition of beliefs. 

It will be recalled that in Fig. 5, and in the discussion following it, I 
had to assume that during the latent (or incidental) learning periods 
there must be invoked a subsidiary curiosity (examining) drive which 
gives positive valences to all parts of the maze and especially to the 
terminal goal boxes and blind ends (even though no biological goal 
Objects were then to be found in these latter).** Thus, even though in 
the latent learning period of a latent learning experiment the rat finds 
NO positive biological goal in the goal box, he does find positive goals 
for curiosity ,*”s in both goal boxes and blind ends. Thus I would now 
tend to accept the Law of Effect in so far as I now believe that these 
Positive goals for curiosity at the ends of both true path segments and 
blind ends do aid the acquisition of the beliefs as to the (sırı — s2) or 

51 > s2) connections leading to such goal boxes and blind ends. Note, 
Owever, that these effects, according to me, favor the learning of what 
leads to the blind ends just as truly as they favor the learning of what 
leads to the goal box.®* 

F urther, my contention would be that a biologically bad effect 
€-8., a shock or Sp~) will be just as good and probably in some cases a 
more favorable condition for learning an (sırı © sp) or an (sı — sp) 


ti "A recent article by Desiderato [14] has demonstrated that maintenance condi- 
Be Which may be thought to favor the strength of a sort of exploratory (curi- 
y) drive, do increase latent learning. s 5 x 
the ah latent learning experiment [24] seemed to bear out this notion that in 
ent period the rats also learn which responses lead to the blind ends. 
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than will be a biologically good effect (e.g., an Sm”) for learning an 
(sr > 5m) or an (1 — sm). 

A related question also arises. When there is a strong biological 
goal (positive or negative) at one end of a choice, how much do the rats 
learn, not only where this positive (or negative) goal is, but also what is 
at the end of the other choice? In all the figures, I have allowed for 
both types of beliefs becoming established. But was this right? I know 
of no adequate experiments which answer the question. The Hullians 
seem to hold that rats learn only the one response, that leading to a 
positive goal, because it is the latter which is followed by “need reduc- 
tion.” Thus Mowrer [58] and Miller [52] and now many others seem 
to assume that in escape and avoidance learning the only thing which 
is learned is the response which leads to nonshock (or security) be- 
cause this is followed by need-reduction of the “conditioned anxiety.” 
But in my terms rats may learn not only what to get to but also what to 
get from. My hunch, nevertheless, would be that in most escape or 
avoidance learning the rat may well learn primarily how to get away 
from or avoid an Sp, and only in some special setups and under 
special conditions will he also be learning how to get a specific 
complementary Sit. 

Finally, however, in any case I do not believe that the rat (or the 
man) learns because after each trial the need (be it conditioned anxiety, 
a hunger, a thirst, a sex drive, a curiosity drive, or whatever) is reduced. 
An animal is not learning to get to need-reduction. The “effects” 
which favor learning (in so far as they do) are not those of need- 
reduction, because a need may often not be reduced but enhanced. 
These effects are rather the mere positive or negative values of the 
terminal goal stimuli, simply as such.?5 

We must note now another quite obvious determiner of the acquisi- 
tion of beliefs, viz., that of the time intervals involved in (SiR > S2) 
or in ($1— S) setups. These time factors might perhaps have been 
included under “Exercise.” In any event, it is now usually accepted 
that, generally speaking, the shorter the time intervals involved in 
(SiR, > S2) or in (S1 > S2) setups, the faster the learning. 

Still another feature about learning, as I conceive the latter, should 
perhaps again be emphasized. In asserting that learning is primarily 
the acquisition of connections of the type (sırı — sə) and/or (sı — sy) I 
have asserted that these s°s and r’s are generalized entities. An $1, aN $2, 
an ss, aN Sj, aN Sp, an sp, and.an Sm, AN Sm, Or an 5, each has some spread 
along its own dimension of generalization. The presentation of a cor- 

"Or, as suggested by the recent exciting work of Olds and associates [63, 64, 
65] and of Delgado and Miller and associates [12, 13], the final “effects” may be 
specific resultant neural excitations in specific parts of the brain, 
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responding $ within such a spread will, to varying degrees depending 
upon the shape of the corresponding generalization curve, be accepted 
by the organism as an instance of the given type of s. Also, when the 
organism has acquired a means-end readiness of the type (sırı — sə) 
there will also be some spread of specific R’s any one of which will, to 
some degree, be accepted as instance of the learned r. These problems 
of generalization are, as we all know, among the most difficult and yet 
most important of the empirical problems which we still have before 
us. And I shall have nothing very final to say about them. 

Such generalizations have been investigated perhaps most often in 
setups for classical conditioning. (Classical conditioning—along with 
the learning of nonsense syllable pairs—falls for me as the reader will, 
T hope, have guessed under the rubric of (sı — s») learning.)®® But 
even here no simple laws have been found. Early in conditioning the 
generalization spread of sı seems to be broad. But later it becomes 
more narrow. And in some stages of learning even a stimulus along a 
totally different physical dimension will sometimes evoke the condi- 
tioned response. Thus, if the animal has been conditioned to salivate 
to a sudden bright light, it may be found that at some stages of the 
learning he will also salivate toʻa sudden buzzer. 

Also, in the case of instrumental (sırı — s2) learning, cases of spread 
from one physical dimension to a stimulus in quite another physical 
dimension were demonstrated in monkeys by the early and important 
experiments of Klüver [37]. 

Work has also been done on generalization in discrimination ex- 
Periments with rats. Rats which have been taught to discriminate 
white and black (i.e., have learned to accept white as the sı which, 
when Jumped to rı, leads to getting to the food platform ss and to ac- 
cept black as the s,’ which, when jumped to 71’, leads to a locked door 
s, will sometimes with practically no. further training discriminate a 
light gray door from a dark gray door—see, for example, [91]. But 
here, also, much more work needs to be done to arrive at truly general 
Principles, 37 


"Thus, for example, in simple salivary conditioning, $ is the conditioned stim- 
ulus (light, buzzer, or whatever) and $» is the unconditioned stimulus, food. Or in 
“eed learning an Sı is the first syllable of any pair and $» is the subsequent 
Yllable. 

" Haire [21] has recently done some human experiments which seem to indicate 
a continua along which perceptual discriminations generalize eventually 
lave to be defined in “phenomenological” terms if any real sense is to be made 
Out of the results, 

urther, it is to be noted that below, under the heading of “Principles of per- 
ance” what I have really done is to suggest a specific theoretical concept as 
to the laws of Sı generalizations. 
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Again, almost no work has been done on the generalization of re- 
sponses (performances) (i.e., what range of R’s will, to a greater or 
lesser extent, be accepted as instances of rı). If the rats have been 
taught, for example, the response rı of “going towards the light,” 
how far will this “going towards the light” spread to going slightly to 
one side of the light? Or, if they have learned to make a right-angled 
right-hand turn rı, what range of other right turns (acute or obtuse) 
will this spread to? We really don’t know. And, finally, practically 
nothing seems to have been done on ss generalizations. 

To sum up: I believe that in the learning of connections of the 
type (ns) or (1— 52), exercise is more important than 
effect. I do believe, however, that effect, in the sense of final positively 
or negatively valued goal stimuli and not in the sense of need-reduc- 
tion, always does play some part (even if these positive goal stimuli are 
only those that result from cognitive exploration). Finally, however, 
the influence of effect is perhaps especially important in simple 
(18) learning, where sı is, say, a shock-box and sy is the resulting 
pain stimulus.: I also believe that the time relationships are very 
important. And finally, I assert that the s’s and r’s involved represent 
generalization gradients and not single values. 

There are still a few other points to be considered. In various earlier 
writings [78, 84, 85] I have invoked a phenomenon of “fixation.”®® 
What I referred to by this term was the fact that under some conditions 
—such as those of extreme overlearning, damaged brain, overintense 
motivation or a conflict situation—the beliefs acquired seem to become 
unduly resistant to extinction. That is, even though the goal, the 5», was 
removed, the animals kept on responding, as if it were present, for 
an unduly long time. In short, what the rats ended up with seemed 
relatively automatic and not much like a “belief.” So I called them 
fixations, rigid “cognitive maps” [85]. At the present time, however, I 
feel that this sharp distinction between a supposed means-end readi- 
ness or belief, on the one hand, anda supposed fixation, on the other, 
was probably a mistake. Some means-end readinesses (beliefs) of 
either the type (sırı > $2) or (s1 — s2) can be thought of as becoming 
unduly resistant to extinction and as taking a lot of negative evidence 
(extinction trials) to break them down. But there seems to be no sharp 
line between such overfixated beliefs and the more easily changeable 


“This type of ss: learning I have in another place [84] called the acquisition 
of “equivalence beliefs,” Recently Spence [76] has also agreed that “effect” may 
perhaps be primarily important in this latter (s: —> s2) type of learning. 

” For an early experiment on overlearning as a cause of fixation see Krechevsky 
and Honzik [40]. 
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ones. The differences should, therefore, I believe, be thought of as 
ones of degree and not ones of kind. 

Still another point. It must be noted that I have proposed no 
quantitative laws of learning other than “more” and “Tess.” I have not 
attempted to write equations, draw curves, or establish constants. It 
has seemed to me obvious that the precise equations, the shapes of the 
curves, the constants, would all be influenced by the specific features 
of the given apparatus and the specific training procedures from which 
they were obtained. Hence they will have little generality. I do, of 
course, however, subscribe to the now well-substantiated fact that, in 
general, learning curves are negatively accelerated and that they tend 
to reach asymptotes, 

Let us turn now to the problem of “secondary reinforcement.” My 
feeling is that this at present ad hoc concept is pointing towards some 
set of facts which is very important but about which we have as yet 
very little real empirical understanding. A “secondary reinforcement” 
I would think of as merely a “subgoal” on the way toward some more 
final goal. And this subgoal has acquired, at least temporarily, a posi- 
tive goal value. Thus, if a chain such as (SR; > SER): 
(SR 87) - (S/R; — Sm”) has often been repeated, then under cer- 
tain conditions (which no one has yet clearly specified) one or more of 
the subterminal $’s seem to take on, at least temporarily, positive value. 
For example, S; (the food) may acquire such temporary value; or Ss 
(the goal box) may acquire it; or $S} (a poker chip) as in the Cowles 
experiments [11] or in the Lambert, Solomon, and Watson experiment 
ro p PAY acquire it. Then the sequence of behavior would read 
(SiR > Sot) + (SR, > Sst) SR, > Sy): (SR, > Sm*), with posi- 
tive values on JS», S4, and S; as well as on S„—such values decreasing 
m magnitude from right to left. But, if I am not mistaken, this has been 
demonstrated only when the initial hunger stimulation Dy, continues 
to be present. The rat or chimpanzee (or child) will, provided it is 

ungry, continue for some time to go to food Sp, even though prevented 
from eating, or to the goal box or vending machine $z, even though 
food no longer be present in it. And the chimpanzee or the child will 
continue to Pick up poker chip tokens Ss and do things to get the 
latter, provided it is still hungry, even though it can no longer use 
t em in the vending machine for food or candy. But there is no clear 
evidence as yet that the animals will do these things when hunger is no 
longer present. That is, as yet there seems to be no good evidence that 
such subgoals Set set up in their own right. Or, in other words, there 
9 no good evidence that there develop real independent second-order 
drives relative to these types of subgoals. We have no certain proof that 
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rats or chimpanzees or children come permanently to love poker chips 
when they are no longer subgoals to food or candy.*° 

Yet, it still seems probable that at least adult human beings have 
developed second-order drives. It does seem as if Allport’s concept of 
“functional autonomy” [1] does hold for them. For most of the good 
present-day work on the psychology of human motivation has had to 
assume and seems more or less successfully to have demonstrated that 
human beings do have second-order drives.*! So I am strongly in favor 
of further experimentation along these lines. But at present I am rather 
troubled at the loose and ad hoc invocation of “secondary reinforce- 
ment” (or, in my own language, of “‘subgoals”) whenever one finds it 
convenient to do so. 

So much for a review of my notions of learning. Let me turn now to 
a brief analysis of the notions as to motivation which are involved in the 
system. 

Motivation. In introducing Djs into the figures I was influenced 
by two considerations. The first was my belief that animals respond — 
only when energized. Only when hungry or fearful or sexually aroused 
or curious (exploratorily inclined) or whatever, do animals and human 
beings perceive and behave and hence have a chance to learn. The second 
was my belief that only if animals are thus energized (motivated) in 
some specific way, do the goal stimuli such as Sm”, Sn, Sp, S ‘5, and 
Set have positive or negative values and hence lead to expected va- 
lences and bring about actual performances. Thus, as I pointed out 
earlier, the diagrams really ought to have included causal arrows from 
the Dis to the pluses and minuses on the goal stimuli. (I left these 
arrows out simply in order to make the drawings less complicated.) 

In introducing Dss I also wanted to allow for the fact of need- 
reductions or of possible need-enhancements as a result of reaching the 
final positive or negative goal stimuli. However, as has been indicated, 
I hold to the view that, in so far as “effect” does favor learning, this 
“effect” is not the changes in the drive-stimulation per se but rather 
the mere final presences or absences of certain types of terminal 
stimuli. It is the positive or negative values on these stimuli (e.g. 
taste, stomach, and nutritional stimuli in the case of hungriness, 
pain stimuli in the case of fearfulness, and some sort of as yet unspecifi- 
able stimuli in the case of cognitive exploration or curiosity) which 


” There is, however, a little evidence in one experiment by MacCorquodale and 
Meehl [49] that rats tend to go more to a food and water box than to an empty 
box even when satiated for both food and water. This could, however, be ex- 
plained by me by the greater exploratory (curiosity) opportunities provided by @ 
baited box. 

“ Consider, for example, McClelland’s need “achievement” [46]. 
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are the ultimately positive or negative goals or “reinforcers.” And, 


motivationally speaking, it is these which the animal wants or does not 
want. He runs the maze to get to them and escapes or avoids the elec- 
tric grid to get from them.*? He does not, I am asserting, run the maze 
to lower his hungriness or his curiosity nor jump off the grid to lower 
his fearfulness. ` 

It must be pointed out also, however, that in the cases of positive 
appetites (e.g., hunger, thirst, sex, etc.) it may well be that when these 
become intense there undoubtedly arise simultaneous pains. To get 
away from these pains may then be added to the drive to get to the 
positive goal stimuli. For example, in the case of food hunger the 
animal may be not merely seeking the “pleasant?” mouth, stomach, 
and nutritional stimuli but also seeking to escape the “pain” of the 
hunger pangs. To discover operationally how much he is seeking the 
positive goal stimuli and how much he is avoiding the negative pain 
stimuli will undoubtedly prove a difficult task. It may be that at the 
behavioral level we can discover no distinguishing criteria, although I 
would not be sure of this. But eventually with increasing neurophysio- 
logical techniques one ought to be able in the end to specify how much 
he is doing the one and how much the other. Furthermore, it must be 
pointed out that, in so far as there be such an initiating pain, the 
symbol for the complex of independent variables in, say, the case of 
hunger would have to be expanded to read as follows: 


2[D,][(S,~) + (HRı > Sy) * (SIRF — Smt) * (HR > SP) + (S7 > Sa) 

(Smt > Sp*)] > [Du,}} 
And similar expansions would also have to be made in the expressions 
for the intervening variables in the means-end readinesses column and 
in the column for expectancies, perceptions, representations, and 
valences, 

Finally, it must be pointed out that I have taken no stand on the 
number and types of drive-stimulations. I assume some basic biological 
drives, such as hunger, thirst, sex, and fear, and I have added curiosity 
or exploratoriness. These are the ones I have used in the figures. But 
there may be “higher” drives such as, say, the drive for “self-realiza- 
tion” which Maslow [51] and others have argued for, and there may be 
acquired second-order and third-order drives, such as the drive for 
Wealth, the drive for achievement, the drive for prestige, the drive for a 
Particular bit of property, etc., etc. Assuming that there be such 
higher- and second-order (and perhaps even third-order) drives, I 
would have to assert that two features must hold for each of them. 


u 
ne Or, as suggested in footnote 35, he runs to get to or to escape from or to 
void certain correlated and resultant neural excitations in the brain. 
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First, there must be certain defining (and ultimately characterizable) 
patterns of external and/or internal stimuli which constitute the defin- 
ing positive and/or negative goals for each of them. And, secondly, 
there must also be assumed that for each such drive there must be an 
ultimately discoverable correlated neurophysiological condition—see 
again [72]. Thus the question of the kinds and numbers of such drives 
will ultimately require both further behavioral and further neuro- 
physiological investigations. 

Furthermore, it may be pointed out, however, that in the mean- 
time psychologists do seem to have arrived already at a good many 
humanly important facts by assuming such secondary drives, even 
though these latter cannot be rigidly identified. For, given the postula- 
tion of these drives, psychologists have studied with fruitfulness and 
success such processes as: satiation and aspiration-level [44]; the rela- 
tive weights of such secondary drives in different personality structures 
[62, 45]; and the differences between the hidden and manifest strengths 
of these drives in different personality types [16]. 

Let me turn now in conclusion to a last feature of my system which 
has not yet been adequately touched upon and which must be ex- 
plicated if the system is to hold water. It has to do with what I would 
call the “principles of performance.” 

Principles of performance. There are three questions to be asked: 
(1) what are the laws or principles governing the intersubstitutability 
of motor skills; (2) what is the nature of the function by which the 
magnitude of the performance of a single approach, escape, or avoid- 
ance is to be predicted; (3) how do two or more simultaneously 
aroused performances interact to produce some new or compromise 
performance? 

1. The first question, that of motor skills, has already been touched 
upon. And I am capable of no proper answer to it. Concrete instances 
of the question would be as follows: why is it that a rat which has 
learned to run a maze can then, with practically no upset, swim it or 
vice versa [47]? Or why is it that a rat which has learnedıto jump over a 
gap to choose a correct door will then, with very little upset, run across 
this gap correctly (if connecting paths be introduced)? (Actually, I 
believe, this latter experiment has not been done but we can be rela- 
tively certain of its outcome.) Or why is it that a rat in a Skinner box 
can interchangeably press the lever with left paw, right paw, or head?** 
Or why is it that those of the Guthrie and Horton [20] cats, which did 
show variability in their methods of tipping the “barber pole” to get 
out, could interchangeably use such variations as snout, rump, Or 


ot Perhaps the first and most striking experiment which emphasized this type of 
variation was Muenzinger’s with the guinea pigs [59, 61]. 
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paw? We have as yet no clear answers. Hull proposed the concept of 
“response generalization” [31]. But this is hardly more than a phrase 
to cover our ignorance. 

Let me present my own position on this point. It is that in any 
of the learning situations which we usually set up, what the subject es- 
sentially learns is primarily a type of organism-environment rearrange- 
ment. For example, he learns to make “this sort of turn” in the maze— 
either in terms of his own body or in terms of the position of light or 
noise cues in the room; he learns to get himself to “this sort of door” 
rather than to “that sort of door”; he learns to push “this type of 
lever” down; he learns to bump into “this sort of barber pole”; he 
learns to wait (i.e., to hold himself) until the next stimulus (be it the 
unconditioned stimulus in a classical conditioning experiment or the 
next syllable in a series of nonsense syllables) appears.** And he already 
has the motor skill equipment to accomplish these organism-environ- 
ment rearrangements whether by walking, swimming, running, using 
this leg or that leg, his snout, or whatever. What he learns is, in short, a 
performance and each such performance can usually be carried out by a 
number of different motor skills.*5 That is, in most of our animal and 
human experiments the skills are already present and the so-called 
“response” we are predicting is a performance which may be carried out 
by any one of a number of skills. But the laws which govern the de- 
grees of interchangeability of, or the hierarchies among, these skills or 
how the skills are originally learned we, or at any rate I, do not as yet 
know. 

Furthermore, whereas many psychologists, especially Guthrie and 
Horton [20] feel that the learning of performances (they call them 
“acts”) will eventually be understood and explained only after we have 
already learned about the acquisition of skills (they call them ‘‘move- 
ments”), I would hold that it is more true that an adequate under- 
standing of skills will come only after we have an adequate under- 
Standing of the acquisition of performances within which these skills 
function. 

2. Turn now to the second question. What is the nature of the 
function by which the magnitude of a simple single approach, escape, 
or avoidance can be predicted? 


“Tam suggesting that what I have above called (S:—> $+) learning may really 
be always an (S:Rı > S:) type of learning in which the Rı is “waiting.” This 
indeed was the position I took in my original book, Purposive behavior in animals ` 
a men [78]. But I no longer wish to insist on this interpretation as against the 

er, 

“In this connection we all remember Lashley’s early experiment with the 
monkey [42] in which the animal learned to open the puzzle box by one hand and 
When this hand was paralyzed could open it equally well with the other hand. 


_ 134 EDWARD C. TOLMAN 


It may be noted that Hull raises practically the same question (in 
his terms) when he presents his function for the strength of a reaction 
potential (an sg) [32]. He conceives of four intervening variables: 
D (drive strength) ; V; (the stimulus intensity dynamism of S1); X (the 
incentive motivation); and sHz (the already acquired habit strength). 
And he writes the function: 


sEr = f(D XVi XK X sHp) 


That is, he assumes that the strength of the drive D, the effect of the 
dynamic intensity V, of the S, stimulus, the goodness of the goal object 
K, and the strength of the habit bond already established between $1 
and Rı (i.e., sx) simply multiply together to determine the degree 
to which the given response will tend to go off, i.e., the strength of sZz 
on the given occasion as a result of the presentation of $}. 

It is to be pointed out now that I too assume a rather similar 
function [88]. My function for a case of simple approach would read: 


Po = fr(Di, Exp, Vs,) as ig Dey Expy Vu) 


The f+ part of this function is similar to Hull’s. Pv is the symbol for 
the magnitude of the performance tendency, ‘® i.e., the tendency for the 
performance (SıRı — S2) to go off. This is analogous to Hull’s sz. Dı 
is a symbol for the magnitude of the initiating drive-stimulation (be it a 
Du, a Dr, or a D,,). This is analogous to Hull’s D, Exps, is a symbol for 
the expectancy that the $z will follow 5,21. This is analogous to Hull’s 
sHy, And Vs, is a symbol for the strength of the valence (positive or 
negative, primary or secondary) which is attached to the representa- 
tion of $». This is analogous to Hull’s X. 

It will be observed, however, that I do not include any term cor- 
responding to Hull’s Yı—stimulus intensity dynamism. If there be such 
a factor as a stimulus intensity dynamism, it would be taken care of in 
my system either by the generalization gradient around sı which would 
determine the degree to which the given $, would be accepted as an 
instance of sı, or it would be taken care of by the degree to which the 
given D; operates to make S; “attended to” and hence determines the 
potency of the resulting $. 

Again, it will be observed that, whereas Hull assumes that D, Vi, 
K, and sHp simply multiply to produce sz, I shall not now attempt 
to make any assumption as to the exact ways in which my variables of 


“«Py” is an abbreviation for “performance vector.” As will be seen in my 
answer to question (3) above concerning the interaction between performances, I 
consider that a performance has both magnitude and direction; hence the tendency 
toward it may be appropriately treated as a vector, 


EEE TEE 
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Dy, Exps,, and Vs, interact, other than to assume that, in general, in- 
creases in any one of their respective magnitudes will tend to increase 
the magnitude of Po. Further, it is to be noted that, since a valence 
Vs; is for me either positive or negative, the resultant Pv may be either 
positive or negative. That is, the resultant performance tendency may 
be either one of doing R or one of not doing R(R). The latter occurs in 
the case of avoidances. 

Turn now to the f, part of the equation. (This, as such, is lacking 
from Hull’s equation.) This f, part of my equation is to allow for the 
assumption that the performance vector will be decreased in magni- 
tude in so far as the organism has a drive against work D,, expects R or 
R to involve work, and this expected work has negative valence. The 
more the organism has, at the time, a general dislike of work (a drive- 
stimulation against work) and the more strongly he expects that the 
particular performance will involve work and the more arduous this 
work is expected to be, the weaker will be his performance vector Pu.‘7 

It may be noted, however, that while Hull does not introduce any 
such work factor as such into his equation for sp he does talk about 
“conditioned inhibition.” And he apparently does subtract sIr (con- 
ditioned inhibition) from gE, just as I subtract the fy function. This is 
not to say that Hull’s notion of conditioned inhibition and my notion of 
expected and negatively valenced work are synonymous. But they are 
undoubtedly somewhat related. So much for my notions as to how the 
magnitude of a single performance vector for approach, escape, or 
avoidance R or R can be written as a function of the causally preceding 
Variables, 

3. Turn now to the third question. How do two or more different, 
simultaneously aroused performances tend to interact to produce a new 
performance different from either one of these separate performances 
by itself? In seeking to answer this question let us consider first, more 
Specifically, the notion of a performance vector, a Pv. A performance 
Vector is to be conceived as a force having both magnitude and direc- 
tion which, so to speak, “pushes” the organism either toward or away 
from the 2, involved in a given (SıRı > $2) expectancy. This means 
that when two or more performances (vector forces) are aroused 
simultaneously it is the vector addition of these forces (these Px’s) 


“Tt must also be recalled that above I suggested that along with a drive against 
work there must be conceived to be a related or perhaps an independent drive 
against uncertainty of outcome, i.e., low probability of the S: consequence. In other 
Words, it may be that there should also be a third term in the equation 
f(D, Exp, Ýa) which should also be subtracted. The u subscripts here would 
stand for uncertainty. 


136 EDWARD C. TOLMAN 


which should (if this conceptualization is valid) produce the magni- 
tude and the direction of the actual performance which will go off. 

It must be noted further that the introduction of this concept of 
performance vectors and their addition means that in the original 
Figs. 1 to 5 an additional column (under the general heading of “ Inter- 
vening variables”) should have been included. This column, if it had 
been introduced, would have been labeled “Performance vectors and 
their addition” and it would have been inserted between the “Expec- 
tation-perception-representation-valence” column and that for the 
final “Dependent variable” (i.e., the final dependent performance). 
It was not included, partly because I did not fully appreciate the need 
for it at the time I constructed the figures and partly because it would 
have made them even more complicated. 

What I shall do from here on, after a little further elaboration of 
the nature of performance and of performance vectors, is merely to 
consider a few concrete examples of behavior experiments which ob- 
viously involve the arousal of two or more performance tendencies 
(performance vectors) and I will indicate what in each case I think the 
vectors would be and how they would add to produce the final actual 
performance. +$ 

The admittedly greatest difficulty with this notion of performance 
vectors lies in the concept of direction. Lewin*® [44] defined the direc- 
tion of a psychological force in terms of the first “step” (in “hodological 
space” as he conceived the latter) of the psychological response 
brought about by that psychological force. For me, on the other 
hand, the direction of a performance vector is defined by the position 
of the $, relative to the position of the $, in the given perceptual- 
expectational field (or space). That is to say, the direction of a perform- 
ance vector is to be specified by the direction running from the ‘‘posi- 


“And in doing so I shall make use of my own no doubt somewhat over- 
simplified and distorted notion of a vector analysis. 

“Tt will be obvious throughout this discussion of performances and performance 
vectors that I have been tremendously influenced by Lewin. It was from him that 
I got my original notions. However, as I have worked these notions over, I am 
now no longer clear as to how much I have retained of his ideas and how much I 
have altered them. And it would take a more analytical and industrious mind 
than mine now to make a careful exegesis and to cite specific references. I wish, 
nonetheless, to emphasize my overwhelming indebtedness to him, 

I would like also to suggest that there may turn out to be a close relationship 
between my notion of performance vectors, and hence of a performance space, 
and Osgood’s concept of a multidimensional meaning space [67]. Further, I be- 
lieve that Osgood’s experimental and statistical methods of identifying the dimen- 
sions of the meaning space, using human subjects, are far superior to any of the 
types of analyses I have yet been able to think up for specifying rat-performance 
spaces. 
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tion” of an $i to the “position” of an $». And this means, of course, that 
the positions of the $; and $s are, or become, part of the discriminable 
perceptual or representational characteristics of such S1’s and $y’s. 

In the case of an approach performance I would symbolize the 
performance vector as > O — ($ıRı > Sy"). In the case of an escape 
performance I would symbolize it as > O — (S$ıRı > S3*). And in the 
case of an avoidance performance I would symbolize it as —0 — 
($i: — 837).% In these symbols the O’s stand for the organism. The 
vector arrows push the organism toward a given Rı which is expected 
to lead to a given $2, or pull it away from this R; and resultant $9. 

Do these three, approach, escape, and avoidance, complete the 
list of possible types of performance as far as their formal characters 
go? I believe they do. I must point out, however, that in a previous 
publication [87] I listed two additional formal types of performances: 
consummatory performances and exploratory scanning performances. 
Tt now seems to me that these two are but subtypes of approach. 

A consummatory performance is simply an approach performance 
in which the $, is a perceived, immediately present goal object which 
is then responded to by the given consummatory response, and $, is 
the resultant pattern of external and/or internal stimuli which is ap- 
proached and reached by such a consummatory response. Thus, for 
example, in the case of consummatory eating, the performance vector 
would be represented as > O — ($R; —> Sin‘). 

Turn to exploratory scanning performances. I would now change 
the name to “identification performances.” These identification per- 
formances constitute a very special type of approach. In fact they are, 
as we shall see in a moment, such a unique type that from now on I 
Shall find it useful to classify them in a separate class even though they 
still are, I believe, types of approach. I shall, that is, group all other 
performances (whether simple approach, consummatory approach, 
escape, or avoidance) under the heading of “pragmatic performances” 
and keep “identification performances” as a separate class. From here 
on, then, I shall be concerned with these two main classes: identifica- 
ton performances and pragmatic performances. It may be noted that 
there will be setups in which we shall have to consider both types of 
Performance vector as going off simultaneously or successively and thus 
affecting the total behavior. 

_, But, first, we must examine in more detail the exact nature of an 
identification performance. An identification is, I shall hold, an ap- 

"In the previous discussion and in the figure for avoidance I introduced the 
‘ymbols 7 and R. I indicate the same meaning here by introducing an arrow 
(performance vector) pointing in the opposite direction from the performance of 
the given Ry, 
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proach from a relatively grossly perceived immediate stimulus 1, to 
the same stimulus perceived and discriminated more precisely, $1’. My 
symbol for such an identification approach vector would be — O — 
(SıRı > $+). The $ı would be the stimulus perceived at first as im- 
mediately present and relatively undiscriminated. The Rı would be 
an examining and identifying response. And as a result of this exam- 
ining and identifying the organism would move into the presence of 
Sy’, the same stimulus but perceived in some more precise way.°! 

Concretely what seems to happen is that, on occasion, when pre- 
sented with an $j, the organism before making any pragmatic perform- 
ance to the immediately resultant $; stops and identifies it more closely 
and in so doing “moves himself,” so to speak, from the initial relatively 
undiscriminated perception $; to a more precise perception $’. A 
pretty example can often be observed in a visual discrimination experi- 
ment with rats. A rat, as all rat runners know, usually starts with a 
spatial position habit (i.e., he discriminates the two ys primarily 
in terms of their spatial positions). But then he is apt, after a certain 
number of trials, to stop and begin to look back and forth, to make 
“vicarious trials and errors” or VTEs, to use Muenzinger’s [60] term, 
before he jumps. And shortly after such VTEs have begun he begins 
jumping by more than chance to the correct visual stimulus, whichever 
side it is on, That is, he changes from undifferentiated discriminations 
of the visual properties of the two cards to relatively precise visual 
differentiations. The initial $,’s have become $1’ as far as the visual 
properties are concerned. And it would appear that these examinings 
and lookings-at, these VTEs, constitute the pointer readings for the 
identification approach of the organism from the poorly discriminated 
visual properties to the more precisely discriminated ones. 

In order, however, to make this clearer, I shall now have to in- 
troduce the concept of “perceptual discrimination spaces.’’5? A per- 
ceptual discrimination space is a discriminable perceptual dimension 
conceived as radiating in a circular arc in front of the perceiving end 
of the organism. If, for example, we take the dimension of brightnesses, 
then the series from white to black can be considered as constituting 
an arc arrayed around the “perceiving nose” of the organism with the 
lightest perceivable white at one end and the darkest perceivable black 

“The drive behind these identification performances is of course the curiosity 
drive already discussed. 

® This concept has not, I am sure, been completely worked out. In fact, I 
myself am still somewhat hazy about it. Yet, intuitively I feel that the concept ` 
points to something important and that eventually something really useful may 
come out. I have already presented previous attempted expositions of this concept 


in connection with what I called the “schematic sowbug” [82, 83, 88]. See also 
footnote 49. 
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at the other. Any brightness stimulus which, to begin with, is relatively 
undiscriminated visually is to be thought of as lying straight ahead, 
i.e., as an equivalent to a medium gray or whatever the adaptation 
level [23] tends at that moment to be. When, as in the example of the 
jumping stand, the rat, however, passes from not perceiving the white- 
ness or the blackness of a card to perceiving it, he moves from pointing 
at a perceived stimulus (an $) straight ahead to pointing towards and 
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Setting to the corresponding perceived “white” or “black” stimulus 
which is angularly displaced from the medium gray. Probably, how- 
‘ver, to make this clear I shall have to present a couple of figures. 

, In Fig. 6 the organism is represented as presented with an objec- 
tively black stimulus .S;. But, since he does not yet discriminate this, 
© points toward his adaptation level of medium gray (see nearer 
arc). However, since the stimulus is really black and he is potentially 
Capable of discriminating black there is a potential angular vector 
(represented by the dashed arrow) tending to turn him towards the 
lack end of the arc. And, as shown in Fig. 7, he sooner or later will 
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tend to turn and to “move” (identificationally speaking) toward this 
Si’ of black. 

Consider now that he has two cards in front of him, one white and 
one black. When he looks at the black card he will “move” towards the 
$ı’ of black and when he looks at the white card he will “move” to- 
ward the S$,’ of white. But why will he VTE? To understand this, I 
shall have to introduce a further concept, namely, that after he has 
“moved” in the direction of either identification, he becomes tem- 
porarily satiated for it. This concept of temporary perceptual satiation 
I have borrowed in part from Glanzer [18, 19]. After the rat has looked 
at black, the vector turning him and then moving him in the direction 
of black now disappears temporarily (or at least becomes weaker) ow- 
ing to momentary satiation. This leaves the vector to turn and move 
(identificationally speaking) towards white in control. He, therefore, 
now looks at the white and identifies it. But the white identification 
process in its turn satiates. And, in the meantime, the black vector is 
assumed to recover. And so he looks back again at the black. Thus he 
continues to VTE until both identification vectors have become more 
permanently satiated. Finally, when this has happened the pragmatic 
jumping-toward vectors take over. And in so far as he has already 
begun to acquire the appropriate pragmatic means-end readinesses 
and resultant expectations as to which card leads to the food and which 
to the net, he also begins to jump correctly more than by chance. 

One further experimental point. It was found by me [82] that in 
the usual visual discrimination setup in which the trials are continued 
only for, relatively speaking, a short period, the rats VTE more for a 
white vs. black discrimination than for a white vs. medium gray dis- 
crimination, and more for this latter than for a white vs. light gray 
discrimination. This seems contrary to what would be expected. For, if 
the discrimination is more difficult, one would on a common-sense, 
human, basis have expected the VTE-ing to have been greater rather 
than less. Certainly we would expect in, say, a lifted weight experiment 
with human subjects that, if the subjects were allowed (using the con- 
stant method) to lift each weight of a pair as many times as they 
wanted before giving a judgment, they would do more such lifting, 
more VTE-ing, when the two weights are near together in heaviness 
than when they are farther apart.5® Yet the rats did more VTE-ing for 
the easy discrimination than for the difficult one. This led me to the 
notion that, in the first part of a visual discrimination experiment, the 
rats are really primarily discovering the stimulus dimension to be paid 
attention to. They are primarily making visual identification responses. 


“Recent experiments by Mrs. Laura W. Phillips in the Berkeley laboratory 
bear this out most beautifully. 
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Whereas, if the usual discrimination training experiment were carried 
further so that the pragmatic jumping vectors would take over, 
the reverse results would be obtained. This led Minium and myself 
to a follow-up experiment [91]. The same group of rats was first 
trained on black vs. white and continued for a long time. The animals 
were then switched to medium gray vs. white and again continued on 
this for many days. Finally, they were switched to light gray vs. white 
and continued for numerous trials. It turned out that in the first few 
trials with each setup they did do more VTE-ing for black vs. white 
than for medium gray vs. white and more for this than for light gray vs. 
white. But it also appeared that after these initial periods the VTE-ing 
for black vs. white sank to practically nothing, whereas for black vs. 
medium gray it stayed at an appreciable level and for the black vs. 
dark gray it stayed at a high level. 

To explain these contrasting results as to VTEs between the first 
trials and the later ones, let me return to the concept of the visual 
identification space. My hypothesis is that in the first periods of the 
discrimination period, since the white vs. black ends of the arc are, 
say, 90° apart, the satiation of the identification vector towards one of 
them does not generalize to (has no projection on) the other. Hence, 
when the animal has looked at white or black, he still has a strong 
tendency to look at the other. He will thus VTE for a relatively long 
time. But when the two stimuli are nearer together on the discrimina- 
tion arc the satiation of the vector for identifying one of them will 
generalize in some degree to the other. That is, in this latter case, 
since the arrows are less than 90° apart, the shortening stemming from 
satiation of one of these vectors will project on the other vector. This 
will result in less VTE. And the nearer together angularly the two 
vectors are the more this will be so, and hence the less the VTE. 

Later, however, as appeared in the Tolman and Minium experi- 
ment [91], when the identifications have been established, the prag- 
matic jumping vectors take over. The VTE-ing now becomes an 
expression of the degree to which the vector to jump to one of the two 
stimuli tends to generalize to a jumping towards the other. In this 
later Stage, the nearer the two perceived stimuli are together on the 
discrimination arc the more a vector to jump to one will have a posi- 
tve projection upon the other. Here, in other words, VTE-ing becomes 
then a case of “partial errors.” And there will be more partial errors 
the more difficult the discrimination. 

Finally, however much the reader may object to the details of 
the above interpretation, I hope he will at least have been convinced 
that some such vector interpretation, even though it may have to be 
Considerably modified, is intriguing and has empirical suggestiveness. 
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It must be admitted, however, that one feature of this concept of 
identification spaces and identification performance vectors which has 
not been touched upon is that of distance. What are the distances 
which the organism has to overcome in passing from the immediately 
presented and relatively undiscriminated $i’s to the finally more dis- 
criminated $1”s? Are these distances (as I have depicted them) the 
same for all discriminations on a given dimension? And do such dis- 
tances, even if the same for one dimension, differ as between dimen- 
sions? And, corresponding to such differences, if there be such, be- 
tween distances does this mean related differences of expected work 
which will act as vectors in the opposite direction which will thus tend 
to weaken the identification process? This whole question of distances 
is a feature of the theory, as it is at present, which suggests, at least to 
my mind, certain further questions which might perhaps be empiri- 
cally answered. 

Another series of investigations is suggested by the problem of 
what conditions, other than actual ease of discrimination, tend to 
determine the amounts of VTE (VTE being for the present our only 
good pointer reading for the fact that identification performances are 
taking place). We do already have some answers. Thus, a wide gap to 
be jumped [27] or an electric grid to be passed over in front of the 
choice doors [60] both tend to increase VTEs. And conversely we also 
have some incidental but unpublished evidence from the Berkeley 
laboratory that, if the animals are too highly motivated for the goal, 
VTE-ing will be decreased. These facts require further experiments as 
to the interrelation between identification vectors and pragmatic 
vectors." 

Let us turn now to the question of pragmatic performance vectors. 
Let us consider a few concrete experiments where the perceptual- 
representational directions are apparently closely parallel to those of 
geographical space. 

First consider performances on a single T maze. On such a maze 
there would appear to be two pragmatic performance vectors which 
get established at the choice point. One would correspond to an 
expected valence to be reached by going down the one arm; and the 
other would correspond to an expected valence to be reached by going 


“Finally, any complete account would also raise the question of identification 
vectors relative to the S:s and R,s as well as relative to the Sis. Just how to get 
at these others, how to get pointer readings for them, I am not at all clear. Finally, 
it is to be stressed, whether or not the above interpretations be correct, there was 
pretty clear evidence obtained by Geier, Levin, and me [17] that early VTE in a 
discrimination setup is correlated with the individual’s cognitive speed in acquiring 
the given type of discrimination. 
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down the other arm. And it would appear that these two vector: 
probably to be conceived as 180° apart, thus —0—. Or, in other w 
the two vectors subtract from one another so that in so far as or 
any given moment is stronger, the resultant in the other directi 
weaker. And this would seem to be true even though rats differ 
demonstrated by Tolman, Ritchie, and Kalish [93]; see also, Wi 
et al. [94]—in that the “space” on a simple T maze for some © 
animals seems to be based upon extramaze cues, such as orieı 
lights or sounds, and the “space” for other animals is based r: 
upon some sort of internal bodily right-hand vs. left-hand cues. 

Further, however, to be quite sure to what extent the two pel 
tual-expectational directions on a T maze are really 180° a 
further experiments must be done in which Y mazes of different a 
lar dispersions are also tried. It might well turn out that all Y n 
with an angle greater than 90° give results as to speed of learning 
as to VTEs which are indistinguishable from those of T maz 
should expect, however, that, as the angle between the Y arm 
came less than 90°, there would, with increasing narrowness O 
angle, be more VTEs and slower learning. In other words, I v 
assume that in these primarily spatial maze problems, VTE-ing i1 
is due not to alternation between identification vectors but rath 
alternation between pragmatic approach vectors. Thus it has 
demonstrated that in a maze [35] VTEs correlate positively with « 
rather than negatively and that there is no correlation of VTESs : 
by individual rats on a maze with the VTEs made in a discrimin 
setup [92]. In a word, the data so far suggest that, in actual 
graphically spatial setups, rats come already equipped with s 
identifications and their VTE-ing is an expression of the cc 
between their pragmatic vectors rather than between their iden 
tion vectors. 

Consider now another experiment in which geographically s 
relations were involved. This is an unpublished experiment by Ci 
field done in the Berkeley laboratory, briefly described in [80] 
group of rats was trained on a Y maze with food at both ends an 
forced by doors, manipulated by the experimenter, to go an 
number of times to each side. A second group was similarly trair 
a T maze, while a third group was on a T-shaped maze. 

After equal amounts of such preliminary training with all 
groups, the experimenter then opened up for each group a thire 
running straight ahead. Crutchfield found that the straight- 
path was now entered most frequently in the case of the Y, le 
quently in the case of the T, and still less frequently in the case 
+. I interpret these results to mean that the “directions” of tl 
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pragmatic performance vectors acquired during the original training 
and corresponding to the entering of each of the two arms cor- 
responded more or less exactly to the actual geographical directions. 
Thus the vectors pushing the rats down the two arms of the Y each 
had a resultant in the direction of the straight-ahead path. The vectors 
corresponding to the two arms of the T had no such resultant. And, 
finally, the vectors corresponding to the two arms of the T had result- 
ants which were opposite in direction to that of the straight ahead. 
The fact that even in this last case a small proportion of the rats 
entered the straight ahead I would attribute to an additional propen- 
sity to explore any newly opened path. This increased by a constant 
amount the tendency to go down the straight ahead for all three 
groups. 

This experiment of Crutchfield’s should undoubtedly be repeated. 
We do not know what spatial axes (those provided by the rats’ own 
bodies or those provided by room cues) the animals were using.°° 
There could be a variety of experimental variations to uncover what 
this space was, in terms of which the directions of the pragmatic 
performance vectors were specified. But, in any event, it seems clear 
that there was a “performance space” and that this space was in con- 
siderable correspondence with the actual geographical space. 

Finally, there is one further type of experiment which I would like 
to propose—an experiment in which again the concept of pragmatic 
performance vectors would be predictive.** It would be a discrimina- 
tion experiment in which the subjects already knew what they should 
pay attention to. And the difference between the cues for right and for 
wrong would be well above threshold. The experiment might be done 
with either human subjects or rat subjects. In describing it, however, 
let us assume that it is done with rats. The apparatus would be the 
usual jumping stand save that there would be landing platforms in 
front of each card with grids on them which could be electrified.57 

The essence of the experimental design would consist in a com- 
parison of immediate and delayed recall for right vs. wrong responses 
learned under two conditions. In the one condition, the approach re- 
sponses would not involve any obstacles (other than the gaps). The 
grids on the landing platforms would never be electrified. In the other 
condition (in the original learning), the grid on the landing platform 

® Crutchfield used an alley maze, not an elevated one. 

© A preliminary unpublished and somewhat inconclusive pilot study of the sort 
of experiment I have in mind has already been performed. It was done by Mr. 


Robert Schweers in the Berkeley laboratory. A brief mention of it has already 
appeared in the literature [87]. 


“ Discrimination stands with landing platforms have been frequently used in 
the Berkeley laboratory [82] and have proved more than satisfactory. 
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in front of the correct card would in each trial be electrified. Thus, in 
this latter condition there would be built up, according to my hypoth- 
esis, not only a forward-pointing vector toward the correct card and a 
backward-pointing one away from the incorrect card, but also an 
additional backward-pointing vector away from the correct card 
(because of the shock). 

What now are the to-be-expected results for immediate recall 
(i.e., immediate relearning) vs. delayed recall (i.e., delayed relearn- 
ing) as a result of these two conditions of original learning? My predic- 
tion would be that the rats which had learned under the punished (or 
shock) condition would do approximately the same as those which had 
learned under the conventional group in the immediate recall test; but 
in the delayed recall test they would do appreciably more poorly.*® 

Why do I expect such results? The answer lies in my assumptions 
as to what sort of pragmatic performance vectors would get set up in 
the original learnings and in my further assumptions as to what would 
happen to these vectors with the passage of time. In the conventional, 
nonshock, condition a forward-pointing approach vector toward the 
correct card would be built up and a backward-pointing vector away 
from the incorrect card would also be built up. In the shock condition 
(in which in the original learning there was always a shock just in 
front of the correct card) there would be built up in addition a back- 
ward-pointing vector away from the correct card (because of the 
shock). Furthermore, in order to reach the same criterion of correct 
performance in the original learning, the forward-pointing vector 
toward the correct card would have to be built up to a somewhat 
greater length (to compensate for the backward-pointing vector due 
to the shock). In a test for immediate recall we may assume that all 
these vectors would retain approximately their original magnitudes 
and hence again there would be approximately no difference in 
this immediate recall test between the two groups. When, however, it 
came to the tests for delayed recall (i.e., after a considerable passage 
of time), I would assume (since there is evidence that the conditioning 
of a negative response persists longer in time than the conditioning of 
a positive response) that the pointing-away vector from the correct 
card would still retain considerable magnitude. This would mean that 
in this delayed recall test, the group which originally had had shocks 
in front of the correct card would do more poorly. That is, its choices of 
the correct card would, because of the persistence of this negative 
vector, be, relatively speaking, somewhat reduced in number. This 
group should, in short, make more errors in the delayed recall test. 

And now I will indulge in a bit of phantasy. I shall suggest that, 


5 And I think Mr. Schweers’s results presented some evidence in this direction. 
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if the above assumptions be correct, they provide an explanation of 
something very like what the psychoanalysts call “repression.” For the 
shocked group the correct card became in the original learning both 
positive and negative. That is, it came to evoke a conflict between an 
approach-toward vector and a going-away-from (avoidance) vector. 
And, if it be further assumed that a going-away-from vector retains its 
strength longer over time than does an approach-toward vector, then 
at a later date there will be more going away from this item that was 
thus conflictful, than there will be a going toward it. It and what it led 
to will, in short, tend to be “forgotten,” as far as performance is con- 
cerned, or in other words, “repressed.” But finally, whether my guesses 
as to the experimental results are correct, and whether, if obtained, 
my further assumptions suggest a valid interpretation of repression, it 
does seem to me that experiments of this sort should be carried out. 

Before concluding this section, a general question concerning this 
whole concept of identification and pragmatic performance vectors is 
in order. For the critic may well ask whether such performance vectors 
can have any real applicability or usefulness for problem situations 
which involve other than truly spatial (i.e., truly geographical) situa- 
tions. By way of answer I can as of now merely assert my faith. I 
believe and insist that, since all instrumental performances can be 
conceived as forms of approach, escape, or avoidance (three essen- 
tially spatial concepts), I also insist that all such performances must be 
analyzable by vectorial concepts. Mechanical and even ideational 
problems [67] are “gettings-to” or “gettings-from” by more “direct” 
or by more “indirect” routes. Furthermore, our language is full of 
spatial terms: e.g., “this when approached is seen to be that,” “this is 
the opposite of that,” “this is a more direct solution than that,” “this is a 
more clumsy, more roundabout, longer way to do it than that,” “this leads 
to that.” The spatial concepts of distance and direction seem to me of 
the very warp and woof of all our thinking about performances 
whether these performances involve actual space or mere mechanics or 
mere logic. However, I must admit that the reason I feel this so 
strongly is perhaps because I personally like geometry better than 
algebra and I have better spatial (visual-kinesthetic) imagery than I 
have verbal imagery. Hence, it may be merely that I personally need 
to translate all performances and problem solving into spatial terms. 


METHODOLOGICAL CONSIDERATIONS 


Before concluding, let me turn again briefly to some omitted, or 
partially omitted, logical and methodological questions posed by the 
outline. 
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Systematic and empirical definitions of independent, interven- 
ing, and dependent variables. I have already indicated fairly clearly, 
I hope, how I would conceive systematically and identify empirically 
the independent and dependent variables. I pointed out some of the 
unclearnesses which we all suffer from in our systematic definitions of 
stimuli and of responses. But I could not clear up the matter to any 
great extent. I did, however, emphasize that responses are for me to 
be conceived as “performances”? rather than as specific muscle re- 
sponses or gland secretions. A performance, as seen by me, is a specific 
type of achievement manipulation of an environmental .object or 
situation. It is an entering of an alley, a going to the right, a going 
toward a light, going away from a light, or to or from a specific place, 
etc. Empirical pointer readings for such performances are set up rela- 
tively easily, probably without much error, by arbitrary stipulations 
such as that such and such part of a rat shall or shall not have crossed 
a particular line. To sum up, the systematic and empirical specifica- 
tions of independent variables and of the dependent variable, though 
still open to many questions, do not seem to lead to much confusion or 
lack of communication between investigators (even those from differ- 
ent schools). Common-sense notions and a little macroscopic physics 
seem, for the most part, to suffice and to allow for adequate objectivity 
and communication. 

When it comes to the question of intervening variables, on the 
other hand, this is where the schools differ. My own particular brand 
of intervening variables were admitted to come primarily from my own 
phenomenology. Thus Köhler’s designation of me as a cryptophenom- 
enologist was probably correct. I do, however, attempt to objectify my 
intervening variables and to suggest standard defining experiments for 
getting empirical pointer readings for them. 

Actually, however, it should be admitted that I really have con- 
siderable doubts not only about the practical feasibility of such experi- 
ments (since they would involve a tremendous amount of time and 
labor) but also about the validity of the results which would be 
obtained. 

My proposal was that one should set up standard defining experi- 
ments in each of which the obtained response or responses (i.e., per- 
formances) could be conceived as depending primarily upon, as being 
a direct pointer reading for, the variations of one particular intervening 
variable as this latter is dependent upon the controlled and prescribed 
manipulations of one or two independent variables. It was assumed 
that one could thus acquire a sort of table showing just what the values 
of each intervening variable would be as the result of such and such 
values of the correlated and controlling independent variable or 
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variables. And it was assumed, further, that these relations of the 
values of each intervening variable to one or two independent varia- 
bles would hold in new, nonstandard, nondefining situations as well— 
so that the values of the intervening variables could be predicted from 
the values of the independent variables in the new situation. But I 
wish now to emphasize that this last assumption might well prove 
invalid. For there may be all sorts of interactions between the varia- 
bles (independent as well as intervening), in the new nonstandard 
situations, interactions which could not have been predicted from the 
results obtained in the standard defining situations by themselves. For, 
in these latter, rigid controls of all but one or one small set of inde- 
pendent variables would have been imposed. Hence, I have consider- 
able doubt concerning not only the practical feasibility but also the 
validity of the proposal. 

It might be said, however, that there is another possible way of 
conceiving my intervening variables. This would be to admit that they 
are merely an aid to thinking (“my thinking,” if you will). All anyone 
really sees are the empirically stipulated independent and dependent 
variables. In developing notions of what happens in between—such as 
beliefs, expectancies, representations, and valences and finally what 
I call performance vectors and their interactions—all I really am doing 
is setting up a tentative logic (or psychologic) of my own, for predicting 
what the dependent behavior should be and how it should be affected 
by variations in such and such sets of independent variables. 

Actually, it seems to me that this is no worse (begging the pardon 
of the Hullians), and perhaps no better, than the assumption of little 
rs and little s’s where these latter intervening variables (though sound- 
ing more physiological and concrete) are at the present stage of our 
physiological knowledge just as untestable as my beliefs, perceptions, 
expectancies, valences, and vectors. There is, however, I would admit, 
probably one greater defect in my cryptophenomenological position. 
My intervening variables seem to be less communicable to others and 
are probably colored by special peculiarities of my own phenom- 
enology. To which, I answer that I like my own phenomenology and I 
believe it to be as good as and probably more fruitful than that of those 
who think in terms of little atomic r’s and little atomic s’s. 

Initial evidential grounds for assumptions of system. Obviously, 
the chief evidential grounds for my system are the experiments on the 
instrumental learning of rats and particularly experiments in mazes. 
And I suppose that for the fundamental distinction between “learn- 
ing” and “performance” which appears in the system as the difference 
between means-end readiness and expectancies, on the one hand (as 
purely cognitive functions), and performance vectors, as the result of 
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cognitions plus motivations, on the other, I am indebted primarily to 
Blodgett. This distinction was forced upon me by his original “latent 
learning experiments.” I owe Blodgett a great debt. Of course, actually 
Lashley was the first one to make the distinction between learning and 
performance. But we did not realize this until later. 

The notion that performance readinesses or dispositions may be 
conceived as vectors grew out of my many associations with Lewin and 
my great indebtedness to him. However, my notion concerning the 
“performance or vector field” differs in certain ways from Lewin’s and 
it is still in a somewhat inchoate state. My ideas concerning vectors 
were also much affected by my experiments on discrimination learning 
and my development of the somewhat bizarre concept of “the sche- 
matic sowbug” and by my observations of the phenomenon of VTE. 
Here in connection with VTE I borrowed much from Muenzinger 
and his students (especially Evelyn Gentry). And I also was much 
affected in my notion of a “performance” as contrasted with a mere 
response by Donald Macfarlane’s early experiment on swimming vs. 
running a maze (done in our laboratory) and also by Muenzinger’s 
early experiment on the way in which guinea pigs may alternate their 
specific skills from trial to trial in getting out of a puzzle box. 

As I have already stressed, my original impetus to the study of 
the learning of rats came from Watson, Thorndike, and Perry. And it 
was from the last that the notion that cognition and motivation are 
perfectly objective components of behavior was borrowed. 

But any attempt to list sources soon becomes invidious because 
one is bound to omit important influences. So it had better be left for 
an historian to reconstruct, assuming that any historian would be 
interested (which seems doubtful). 

Generally speaking, it may be said that it is the ideas of my own 
students which have influenced me most. (I wish I could list them all.) 
And that the S-R schools of Pavlov, Guthrie, Hull, Spence, Mowrer, 
and Skinner have served rather as challenges than as sources of initial 
ideas. However, as was seen above, the beautiful work of Neal Miller 
and his cohorts I have taken to myself with a bang as a way of speci- 
fying the ultimate goal stimuli. 

Under the suggested subheadings which contrast immediate data 
language with construct language, 1 have nothing to say other than that 
they may perhaps be inferred by somebody else from the body of this 
paper. But I myself can neither get very interested in nor completely 
understand such more refined logical distinctions. 

Construction of function forms. My “function forms” are, I sus- 
pect, merely “general adumbrations” of what I assume to be the func- 
tional relationships between the independent, intervening, and depend- 
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ent variables. They are merely of the sort: “the more of this, the more 
of that”; “the more of this, the less of that”; and the like. I would not 
attempt to fit curves or to specify the values of parameters. As I have 
already indicated above, such attempted precision seems to me for the 
most part premature. Wherever one can do it, the experimental 
conditions are so overcontrolled, restricted, and specific that any valid 
generalizations from such attempts seem to me impossible. 

Mensurational and quantificational procedures. Obviously, my 
mensurational procedures reduce pretty much to mere statements of 
“more” or “less.” I am not greatly concerned about equality of units, 
absolute zeros, or the like at this stage of the game. 

Psychology today seems to me to be carried away (because, per- 
haps, of feelings of “insecurity”) into a flight into too much statistics 
and too great a mathematization. This may well be my personal bias 
and stem from the fact that I have not kept up in my mathematics. 
But to me, the journals seem to be full of oversophisticated mathe- 
matical treatments of data which are in themselves of little intrinsic 
interest and of silly little findings which, by a high-powered statistics, 
can be proved to contradict the null hypothesis. 

A great lack in my system is, of course, that it contains no real 
consideration of individual differences in spite of the fact that I owe 
to Tryon an original interest in this direction. I suppose I feel that, 
ultimately, individual differences will be found to be most usefully 
treated as variations along the dimensions specified by the intervening 
variables of the system rather than by dimensions blindly got at by 
factor analyses. 

Formal organization of the system. All I can say here is that my 
system is based on hunches and on common-sense knowledge. It is cer- 
tainly not “hypothetico-deductive.” I have not the type of mind that 
can remember which were my axioms and which were my deductions. 
In any event, if a system were a SYSTEM, which I do not believe 
psychology to be, it would be largely arbitrary which one took as 
axioms and which one took as derivations. To attempt to build 
psychology on the analogy of a closed mathematical or logical system 
seems to me a “bad error.” 

I shall not say anything about “implicit” definition, “explicit” 
definition, “empirical or operational” definition, nor “coordinating” 
definition since my knowledge of these logical distinctions is too slight. 

Scope or range of application of system. As indicated in the body 
of this essay, the system as actually developed was based primarily on 
experiments dealing with the instrumental learning of rats. The basic 
problems of learning and motivation seemed, however, also to be 
involved in this more restricted area, so that it has always been my 
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hope that the relations found in these simple problems with rats could 
be extrapolated to the behavior of the higher animals (including man) 
in more complicated situations. And on occasion I have loosely made 
such applications [84]. I have, however, no programmatic plans for 
extensions of the system to such wider areas. 

It is interesting to note that in many ways my system proves not 
too different from Hull’s save in two assumptions: (1) that it is not 
S-R habits but (sr — s) means-end readinesses which are stored up as a 
result of practice, and (2) that the final propensity to behave resulting 
from both learning and motivation is best conceived as a performance 
vector and not as a reaction potential [87]. 

History of system to date in mediating research. Just which par- 
ticular researches done in the Berkeley laboratory or elsewhere. have 
been “mediated” (or perhaps better put) “influenced” by my system, 
I have no way of estimating. In general, what I hope I have done is to 
develop a general attitude and point of view, and that from this point 
of view stimulation for further research, either within or without the 
specificities of my system, has resulted. 

I certainly have not “derived” specific deductions to be tested. 
Rather I have worked by hunches and, I hope, have thereby stimu- 
lated their own hunches in other persons. 

Evidence for the system. I am incapable of any good answer. The 


` evidence, such as it is, I find throughout all the experiments on instru- 


mental learning, wherever done. At least I am almost always capable 
of translating, or if you will, distorting, this evidence so as to use it as 
support for my system (my way of looking at things). One body of 
data which, however, I cannot do much with is that on the acquisition 
of sensorimotor skills in adults or with the way in which performance 
vectors govern and select from among such skills. Neither have I 
actually been able to do much with the higher processes of thinking, 
verbal communication, and the like in human beings, though I do feel 
that these latter processes will some day lend themselves more readily 
toa formulation of the (SR — S) type than to a simple S-R formulation. 

Specific methods, concepts, or principles of the system believed 
le outside the context of the system. I have nothing to say 

ere. 

Degree of programmaticity. Again I have nothing to say except 
to report that I am apt to be accused of being too programmatic and of 
not getting down to actual work. 

Intermediate and long-range strategy for the development of 
the system. At my time of life, I have no real strategy for the future. I 
do, however, hope to carry out some experiments soon on the memory 
for decisions or choices in human subjects (based theoretically on the 
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sort of performance vector analysis which I adumbrated above) and in 
these experiments the variables of difficulty, length of delay, and the 
presence or absence of special rewards and punishments upon recall 
will be tested. Such experiments may even prove to have some bearing 
upon the psychoanalytical concept of “repression.” 


CONCLUSION 


I started out, as I indicated in the introduction, with considerable 
uneasiness. I felt that my so-called system was outdated and that 
it was a waste of time to try to rehash it and that it would be preten- 
tious now to seek to make it fit any accepted set of prescriptions laid 
down by the philosophy of science. I have to confess, however, that as 
I have gone along I have become again more and more involved in it, 
though I still realize its many weak points. The system may well not 
stand up to any final canons of scientific procedure. But I do not much 
care. I have liked to think about psychology in ways that have proved 
congenial to me. Since all the sciences, and especially psychology, are 
still immersed in such tremendous realms of the uncertain and the 
unknown, the best that any individual scientist, especially any 
psychologist, can do seems to be to follow his own gleam and his own 
bent, however inadequate they may be. In fact, I suppose that actually 
this is what we all do. In the end, the only sure criterion is to have fun. 
And I have had fun. 
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INTRODUCTION 


The volume, Modern Learning Theory [1], published in 1954, is a 
remarkable enterprise. In it the authors have achieved a penetrating 
and objective analysis of the distinguishable lines of attack on the prob- 
lem of human and animal learning, and the effect of the volume on 
current psychological theory cannot fail to be far reaching. The field is 
for the first time in print shown up in an aerial perspective that displays 
the wandering trails on which we have been traveling, some of them 
possibly leading nowhere, and the whole system failing to reach much 
agreement even on direction. The clear picture of the flounderings of 
present-day theory will surely reduce confusion and lead to new formula- 
tions, 

The present chapter is part of what might turn out to be a com- 
panion venture, and is a statement by men who for the purpose of the 
task are writing as systematists rather than as critics, The writer of the 
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present chapter undertakes this venture with a certain humility and 
definite misgivings, being well aware, when the invitation to participate 
mentioned that a systematic formulation might vary “from one or a few 
orienting ideas . . . toward the organization of extant knowledge within 
a given empirical domain . . . to an explicit, elegant, and quantified 
systematization,” that what is to be outlined in this chapter is definitely 
to be placed at the first extreme of the distribution. The editor has sug- 
gested an introduction covering background factors and orienting atti- 
tudes, the structure of the system in so far as one has been developed, and 
the evidential grounds for the adoption of the system. The rubrics that 
will follow these are only a crude approximation of the headings sug- 
gested by the editor, which were designed for the exposition of a devel- 
oped system. The present account is better described as an orientation 
than as a system. 

The background factors are to include a short account of the writer’s 
education and some brief mention of the men and books that established 
his perspective. 


BACKGROUND FACTORS 


At the turn of the century, although the University of Nebraska had 
one of the first psychological laboratories in the United States, psy- 
chology was still, as in nearly all American universities, in a department 
of philosophy, though the staffs were distinct. At the time the writer 
entered in 1903, the university had none of the present apparatus of 
required courses and set curricula. The only requirement for the 
bachelor’s degree was credit in a number of courses extending normally 
over a period of four years, to include one 2-hr course in English com- 
position, taught largely by the lecture method. This freedom made pos- 
sible the inclusion of courses in both Latin and Greek which had been 
begun in high school; mathematics through calculus and including, as 
the nearest approach to modern statistics, a course on the theory of errors 
using a text by a professor of gunnery at West Point; several courses in 
philosophy; but in psychology only the general course before the 
bachelor’s degree. There followed three years of teaching mathematics 
in a local high school and a master’s degree in philosophy that included 
graduate courses in mathematics and in psychology under Thaddeus 
Bolton and H. K. Wolfe. H. B. Alexander, whose courses were listed 
as philosophy, took much interest in issues that would now be recognized 
as psychological. H. K. Wolfe had taken his Ph.D. degree under Wundt 
and had returned to the university after a brief absence because there had 
been debate over his orthodoxy. The interlude was, fortunately for me, 
timed so that he was my high-school principal at the time of graduation 
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and was teaching psychology at the university (in the Department of 
Education) by the time I began graduate work. In one graduate.course 
it was my good fortune to be his only student, and although the course 
and our conversations dealt very little with the body of facts that a 
modern graduate student in psychology must be prepared to face in his 
examinations, his views on the philosophy of science were of great 
interest. A research course under Bolton devoted a winter to observations 
with an aesthesiometer on the limen of twoness, and served to quench 
interest in psychophysics, which was the chief preoccupation of psy- 
chological laboratories in those years. 

The books still remembered from my Nebraska studies are all 
philosophical: Biisse on the mind-body problem, read in German in a 
seminar of Hinman’s; Bosanquet’s Logic, with its conclusion that the 
ground of every judgment is the whole body of knowledge; Bradley's 
Appearance and Reality, which developed the paradoxes involved in all 
the traditional metaphysical concepts. Ernst Mach and Avenarius were 
later read with that same conviction that had accompanied reading 
Darwin’s Origin of Species and his Expression of the Emotions with 
a companion while in the eighth grade. The later appeal of Dewey’s hu- 
manism and his views on the nature of truth might be interpreted as in- 
fluences, but that interpretation is put in some question by the fact that a 
high-school graduation thesis written for Wolfe had taken the position 
that both science and religion, being dependent on words, and words 
being symbols dependent for their meanings on the experience of their 
users and auditors, would have no chance at expressing Absolute Truth. 
It is possible that a resigned acceptance of this conclusion, which Wolfe 
called curious, was the reason for the enthusiasm over Bradley, and, later 
on, Dewey. Acquaintance with Freud began in 1913 when as a fellow in 
the philosophy department at the University of Pennsylvania and as 
factotum of the department seminar I invited Alexander Goldenweiser 
down from Columbia to address us. 

At Pennsylvania in 1910, the first year was spent chiefly in ancient 
philosophy. Newbold’s seminars in Aristotle and Plato used Greek texts. 
During that year my enthusiasm turned to E. A. Singer who, like many 
philosophers of his day, extended his interest into psychology, though 
the only psychological paper I remember of his was on Fechner’s law. It 
was his paper on Mind as an Observable Object, delivered at a meeting 
of the American Philosophical Association in 1910, that began to turn 
attention again to psychology, but this interest was confined to journal 
reading and arguments with several graduate students majoring in 
psychology. My thesis was in the area of symbolic logic and dealt with 
Russell’s paradoxes, those propositions whose iruth appears to imply 
their falsity, and whose falsity to imply their truth. “This proposition 
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is false,” is an illustration. The solution suggested was that when such 
terms as “this” were defined, the paradox disappeared. 

This excursion into logic and Singer’s exposition of the dependence 
of axioms on definitions had a lasting effect on my attitude toward 
logic. My second published paper [5] after the thesis was an argument 
that the laws of logic are conventions and not laws of thought. The 
fact that it had taken Russell and Whitehead some 400 pages to establish 
the conclusion that one plus oné equals two, and that every intervening 
step could be challeneged and would require more proof, and that the 
steps of these added proofs would require still more, has made me im- 
patient with the notion that there can be any completely rigorous 
deduction, or ultimate validity in an argument. This scepticism colors my 
notions of the nature of scientific facts and scientific theory. 

This account of personal background is already too long, but may 
bear the addition of several possible determiners. My first university 
teaching was at the University of Washington in 1914, a position held 
until retirement in 1956. A too brief acquaintance with the economist 
Carleton Parker who, as dean of a just started College of Business Ad- 
ministration, persuaded me to undertake in 1917 a mission to inter- 
view loggers in the state’s camps for the Secretary of War, and a very 
brief career in both an infantry and an artillery OTS in 1918 served to 
turn attention from books to men. A year’s collaboration with the late 
Stevenson Smith on an elementary text entitled General Psychology in 
Terms of Behavior [14] proved to be invaluable training because it 
represented from four to five hours of argument and writing usually 
seven evenings a week over a period of a year. Smith had had far more 
training in psychology and the experience of clinical work with children 
for six years. Our diverse backgrounds, one with a degree in philosophy 
and mathematics, and the other with a degree in psychology and 
neurology, made almost every paragraph of the book being written the 
occasion for argument. Smith’s clinical interest in the problems of child 
psychology tended to keep our attention on the practical applications of 
theory, whereas my chief interest lay in undergraduate teaching, a fact 
that probably accounts for a strong bent toward simplification which, 
with some justification, has been described as oversimplification. 


ORIENTING ATTITUDES 


Certain strongly held beliefs concerning the nature of human com- 
munication and language, the nature of facts and explanations, and the 
nature of scientific truth have undoubtedly colored my excursions into 
learning theory. When we say “Let’s get down to the facts,” we do not 
mean that we shall merely look or taste or feel, but that we shall seek in 
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a situation or event what description ‚in words we can agree on and 
expect others to agree on. Men, therefore, help to make facts, and human 
purposes and motives enter into their selection and formulation. Before 
there were men and before language existed, the world got on well 
enough without being described. There were no facts in such a world. 
Facts involve men and discourse and common understanding and shared 
purposes. There is no such thing as purely inductive reasoning. No 
amount of repetition of events will produce a generalization. The mere 
repetition of events will not even result in their being numbered. Number- 
ing is a device developed by men. Nature does no counting. 

One critical test of fact and of the truth of propositions is in terms 
of their acceptability among men. A lone man in a world could do as 
well as a cat without speech and, therefore, without statements or 
propositions, and there would be available no test of the truth or falsity 
of such noises as he might make. These might frighten birds, but they 
would not be true or false. 

Scientific explanations are made in public. They are human acts 
and have their own motives and their effects on men, Scientific truths 
must meet these conditions of general understanding, though in their 
case the public is a special one. It is made up of men with a common 
interest in a special field and with experience in that field. There are 
special conventions to be met, and an explanation that is unintelligible 
to scientists working in the field is no explanation in a scientific sense. 
The concepts used must have been established in common usage. ; 

The explanation of an event or class of events must take the form 
of the statement of a rule of which the event or class is an instance. The 
pursuit of science is a search for a body of rules which will serve a 
number of purposes. One of these purposes is to enable men to share 
knowledge and thus to share the prediction and control of events. Another 
purpose of great importance is so to systematize and codify the rules that 
they can be taught to an oncoming generation. 

It is the requirement that rules be communicable and teachable that 
makes simplicity desirable. During the eighteenth century, the impressive 
beginnings of the reduction of physical science to rules had led to a 
notion that nature was set in a pattern of simple laws, the “Laws of 
Nature.” We have only slowly discovered that the requirement of 
simplicity is a human requirement. It is men that are simple, not nature. 
The simplification of nature that is achieved by scientific laws is achieved 
at the expense of prediction—by limiting prediction to classes of events. 

Scientific prediction is always in terms of a rule—if an event of class 
A, then an event of class B. A and B are both abstract classes, not 
concrete events, and the prediction is contingent upon a number of as- 
sumptions. One of these is that the actual instance is an instance of A, 
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which is always a matter of judgment. Another assumption is that the 
general situation does not include “interference” from circumstances 
that have effects inconsistent with B. When a scientific law is used for 
the prediction of a specific event, these disturbing contingencies are con- 
trolled if possible; if not possible, they are allowed for by granting a 
certain leeway of error. When physics undertakes to interfere with the 
actual world we call it engineering, and the engineer uses safety factors 
of up to 1,000 per cent in order to be reasonably sure of the outcome. A 
physical law like Boyle’s law is stated without this allowance for error; but 
the error is always there and the points are never exactly on the curve. 

There is no question that human behavior is highly predictable. Our 
lives would be intolerable if we could not depend on this predictability. 
Clinical practice is built upon the extreme predictability of annoying or 
embarrassing behavior. Clinicians are asked to suggest measures that will 
be followed by changed behavior. Their first responsibility is to the pa- 
tient and they must take advantage of predictability on any available 
ground. 

Most of the predictability of human behavior is in an area which 
is not covered by science. It derives from acquaintance with the indi- 
vidual, the familiarity that enables wives to anticipate the behavior of 
husbands or mothers to know what to expect of children, children to 
take advantage of the habits of parents, the police to recognize the work 
of known offenders, politicians to be wise in the ways of voters, credit 
bureaus to collect profitable estimates of the debt-paying habits of mer- 
chants. Such information is highly specific and depends on acquaintance 
with the individuals concerned. It is not science because it is not re- 
duced to general rules; it can be learned, but it cannot be taught as a 
formal body of rules. It will not be found in textbooks because textbooks 
cannot be burdened with individual biographies. 

Considering the fact that the lives of all of us are guided by such 
information, it is rather remarkable how unsuccessful we have been in 
formulating general rules of behavior, in describing the general circum- 
stances under which behavior is changed by experience—the laws of 
learning. Learning is illustrated in every movement we make and we are 
continuously engaged in altering the behavior of our fellow men, but 
we have not been able to agree on any set of rules that such changes 
follow. 

The knowledge by acquaintance that enables us to anticipate the 
ways of friends and family, the anthropological collections of descriptions 
of the specific ways of groups, the skills that enable a paleontologist to 
Tecognize the place of a skull in evolution, or an ethnologist to recognize 
the folkways of a group, or a physician to name and describe the prob- 
able course of a disease are not science. Knowledge becomes science only 
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when it can be stated in general terms and in rules that are independent 
of time and place. Scientific knowledge can be put into print and read 
and understood by strangers at a distance. 

What, then, will be the terms in which general rules of behavior are 
stated? What will constitute the antecedents and what the consequents? 
What can we look for as stimuli and what as responses? These words 
mean much the same as antecedent and consequent, except for the 
slightly misleading implication of causality which does not enter into 
scientific rules. Causality is a practical aspect of specific events and takes 
all contingencies into account. Scientific rules do not cause events; events 
are caused by the actual state of the world. 

In the physical sciences, this problem has been solved by minimizing 
the role of the observer and by the use of conventional instruments of 
measurement with the attainment of a high degree of objectivity, or 
as the authors of Modern Learning Theory have expressed it, a high 
degree of intersubjective agreement. The part of a human observer is 
eventually reduced, by using accepted calibrated devices, to such ele- 
mentary features as the noting of position on a scale which often can be 
recorded and made open to reading at any time, or duplicated and so 
made public. Many subatomic phenomena are recorded by lines on a 
photographic plate which can be submitted to the perception of many 
observers. It is worth notice that the role of the human observer is 
minimized, not abolished. 

In psychological observation it would, of course, be a great advan- 
tage to reduce both stimuli and responses to this point of high agree- 
ment. But there is reason to believe that this reduction cannot ordinarily 
be obtained in psychology. The phenomena in which the psychologist is 
interested are not specified in terms of mass, length, and time. They in- 
volve categories not reducible to position on a scale. In fact, they nor- 
mally involve patterns of situation and movement that require recogni- 
tion by a human observer, and this recognition is of an order indefinitely 
more complex than the recognition of relative position involved in com- 
paring a length with a scale. 

However, the fact remains that such recognition can satisfy the re- 
quirements of science. Take, for instance, a case in which the categories 
are very far removed indeed from scale measurements. The writer re- 
cently heard Dr. Edgar A. Doll describe a measure of social competence 
which he and his associates had developed primarily for conveying infor- 
mation about the degree of impairment of social competence in cerebral 
palsy patients. The test included a series of items adjusted to ages from 
one to twenty-five years. The items included such behavior categories as 
able to clothe himself and able to make simple wants known through 
speech. Observers could reach a very high degree of agreement on 
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whether or not these response categories were illustrated in the behavior 
of individuals observed. In fact, Doll reported, this agreement could 
reach across cultural boundaries and apply to children of different cul- 
tures, I have no doubt that the items could be translated into other lan- 
guages and agreement reached between observers who could not under- 
stand each other’s speech. Skills named in these items are a far cry from 
the scale positions to which physics reduces its observations, but the fact 
tKat we must use human observers as instruments does not alter the 
fact that the agreement they reach makes the data profitable material 
for behavioral science, Data obtained in this manner can enter into 
general rules and serve either as the data from which predictions are to 
be made or as the data which we wish to predict. 

During the first century of the development of psychology, we have 
made great efforts to be objective. We hoped to achieve this by limiting 
ourselves to the categories of physics and using as the weather signs of 
behavior only the physical or chemical events normally activating sense 
organs. The determinations of absolute thresholds in the various senses, 
the hope that response could be treated just as movement in space which 
was the crude interpretation of behaviorism, failed to carry us very far 
toward the understanding of behavior. The reason for this is that we 
cannot reduce the classes of psychological facts which make up the data 
we must deal with to component movements in space. Patterns of stimuli 
and patterns of response have their psychological significance and useful- 
ness tied to their patterning—pattern as pattern must be recognized and 
dealt with, Machines can be devised to respond to pattern, but the 
human observer remains the only practical tool we have for the recogni- 
tion of patterns in their variety and multiplicity. The more simple data 
are not to be despised. Like the Berger rhythms there is always a chance 
of relating them to important psychological events or to the underlying 
mechanisms of behavior. Fechner’s law, for instance, has some note- 
worthy applications to psychological behavior. 

The history of our effort to use as the weather signs of behavior sim- 
ple physical or chemical changes involving sense organs is an interesting 
one. One difficulty it encountered was that these stimuli did not always 
stimulate. Anesthesias or the more complex phenomena of attention 
could be invoked to explain failure. But the real failure goes deeper. The 
patterns of physical change that occasion response, we find ourselves in- 
evitably describing in perceptual terms. It is not enough that they be 
available in the physical situation nor is it enough that the organism’s 
attention orient sense organs to receive them; it is further necessary that 
they have meaning for the responding organism. In describing the be- 
havior of animals we speak of such stimuli as “the sight of the box,” “the 
sound of the food pellet,” “confinement in the goal box.” “Confinement” 
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is a complex relation that is not just a matter of physical barriers. With 
human beings, such stimuli as “hearing a word,” “viewing a printed 
syllable” are necessarily used as stimuli. We do not specify the type style 
in which the syllable is printed except when we are overscrupulous, be- 
cause it is the syllable as legible rather than as a physical pattern that is 
significant. In other words, the observing of stimuli must include noting 
that they stimulate. That is ordinarily open to observation. We judge it 
in terms of the (usually) immediate alteration of response. Having ob- 
served that a given pattern does this, we assume that it will be effective 
on certain other occasions. Only physical patterns that are stimuli in 
this sense can serve as associative signals. Psychologists, without having 
ever recognized this explicitly, always choose their signals in the light of 
their experience that they do get responded to by the organism. 

It is worth noting that a strong reinforcement of the tendency to 
reduce both stimuli and responses to elementary physical changes at 
sense organs or to muscular contraction and secretion followed the work 
of Pavlov, who was a physiologist, and not a psychologist. Pavlov [12] 
was not consistent in this, and “presentation of food” was assumed to 
be a simple physical stimulus although it was essential that the dog per- 
ceive it as food. Similarly the registration of drops of saliva emphasized 
one small feature of a complicated response. 

To object to treating a simple physical change as a stimulus or a 
muscular contraction as a response is not to deny that all stimuli are 
analyzable into such physical changes or to deny that any specific re- 
sponse is analyzable into muscular contractions and glandular secretions. 
That should be assumed. It is, however, a denial that the psychological 
description of behavior can be made in physical terms. It requires psy- 
chological terms which will name recurring patterns of physical change 
usually requiring identification by an observer which will include recog- 
nition of their stimulus value usually judged by time relation to the 
response. A CS is not a stimulus for a GR until the association has taken 
place. It is an accident of our manner of speaking that we refer to the 
sound of a buzzer as the CS whether it is a stimulus or not. We should 
have a symbol to indicate a physical pattern we expect to act as a CS. 

What has been said about the factual data in terms of which we de- 
scribe behavior changes implies that it is primarily the presence or ab- 
sence of specific stimuli and specific responses that is observed, not their 
intensity or degree. Simple physical stimuli like noise, pressure, light can 
be noted in terms of degree. A few responses, like the dog’s reaction to 
Pavlov’s “presentation of food,” can be measured in some detail by 
counting number of drops of saliva secreted or the range of a leg retrac- 
tion. But most responses, as observed, occur or do not occur. We note 
their presence or absence. The efforts to abstract measurable attributes of 
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response like latency, vigor, time required to extinguish fail to agree with 
each other and so can not be regarded as measures of response strength. 
Only one measure shows promise. William Estes [2] has suggested a 
statistical description of response strength in terms of response certainty. 
This is, of course, dependent on the observation of presence or absence 
and does not imply degrees in the response but a calculus of the prob- 
abilities of the occurrence of the physical events that may serve as stimuli. 
He makes, however, two assumptions that may violate the nature of the 
actual process. One is that the elements that make up a situation have 
independent probabilities. The world does not present itself to men in 
that fashion. Physical stimuli (if we may by that term refer to the phys- 
ical events that can serve as stimuli) come organized in objects and re- 
curring sequences. We encounter the same object again and again. Our 
classification of objects into kinds and into genera and species, our names 
for recurring events, are not just our own responses to the world but a 
recognition of the way the world is. The physical structure of man is an 
adaptation to this repetitiousness in nature. Our brain and nervous sys- 
tem is an elaborate machine for recognizing objects and situations and 
for the evocation of appropriate response. The development of visual 
perception would seem to lend force to Estes’ assumption, it is true. We 
May start out completely naive and it is only experience that gives rise 
to the phenomena of visual perception. But some data from instinctive 
responses to vision, like the behavior of Thorndike’s chicks which jumped 
or did not jump from a box according to its height, and some of the 
gestalt observations on general features of perception at least raise a 
question. The second assumption is more doubtful than the first, namely, 
the assumption that the effect of elements is additive. Patterning, rather 
than summation, appears to be the effective determiner of response. We 
May eventually conclude that patterning itself may be treated in Estes’ 
terms and that the presence of a certain proportion of the elements of a 
pattern determines the response. George Boguslavsky, in an unpublished 
doctoral thesis, is attempting to deal with the phenomenon of increasing 
certainty of response with additional pairings of signal and original stim- 
ulus by assuming a limited group of stimulus patterns associated with the 
general situation in which the learning occurs and also assuming that the 
possible patterns (such as the possible postures or activity open to 
Pavlov’s dog in the experimental harness) being equally probable are 
all gradually “conditioned” to the response, so that the response to the 
signal is then highly probable. 

Where the description of learning shall start is a moot point. Tol- 
man’s Suggestion [16] that the behavior of a rat at a choice point is a 
key event is well taken. Watching Dr. Paul E. Fields’ motion pictures of 
what Tolman has called “vicarious trial and error” one is struck with 
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the obvious and active scanning of the row of potential cues, one of 
which indicates the openable door, and the others punishment. The 
learning has occurred on previous occasions, because now the VTE is - 
so often followed by the “right” choice. The scanning is obviously for 
familiar features, and it is tempting to say that the recognition of some 
of the patterns provokes cautionary behavior that is essentially the be- 
havior of avoidance called out by punishment, but now associated with 
the pattern. The recognition during the scanning of a pattern to which 
the last response was jumping is followed by a jump. 

But the last response to one of the “wrong” figures may also have 
been followed by a jump. The jump was followed by impact on the door 
and falling into a net. I see no way to avoid the description of this in 
Tolman’s terms as vicarious trial and error in the sense that the sight of 
the wrong picture evokes, though not the prompt execution, at least some 
“running through” of the previous catastrophe with results that inhibit 
jumping. Tolman calls this “expectancy” and the term is acceptable, 
though I would think of expectancy in terms of the actual evocation of 
avoidance movements. These movements are not ordinarily overt and 
complete. That would require an actual jump and impact; but they are 
sufficient to interfere with jumping. 

The law of association by contiguity is perhaps a feeble description 
of this, and it would have to be supplemented by further description of 
the nature of minimal rehearsals of previous behavior and the circum- 
stances under which they occur. Thurstone [15] identified the capacity 
for such minimal rehearsal with intelligence. The law of association by 
contiguity is an attempt to describe the essential event. The situation is 
scanned for familiar features. When a familiar pattern is discovered, the 
response last following that pattern will be evoked. 

This, of course, has-pathetically little resemblance to such a law as 
the law of gravitation. It contains the ambiguity over minimal and overt 
response which must be faced with some such device as just described. 
The fact that the last response associated with the stimulus may not 
occur because other stimuli are also present and may interfere does not 
differentiate it from gravitation, however. All physical objects are sup- 
posed to be following the law of gravitation; but this gives little informa- 
tion about psychological behavior or the difference between Republicans 
and Democrats, though no one would deny they are at all times gravi- 
tating objects. No one has ever found any object to do more than ap- 
proximate the course that the law of gravitation would predict, and we 
do not even know what the gravitational course of any object would be 
in a universe that contained more than two objects. The law yields only 
approximations of the courses of masses in which other forces are at a 
minimum. The disadvantage of the law of association is that its terms are 
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not quantified, and in the prediction of the presence or absence of a re- 
sponse from the presence or absence of a stimulus (assuming no inter- 
ference from the general situation—no conflicting responses) our stim- 
ulus datum cannot be easily isolated, and it is harder than physical move- 
ment to specify, to name, to observe. 

We may well ask how much advance learning theory in terms of asso- 
ciation by contiguity has made since Aristotle. I am not convinced that 
the advance has been substantial and I do not believe that it has ad- 
vanced through the multitude of laboratory studies of the last genera- 
tion. The direction of real advance may be in the direction that Estes is 
exploring—in the development of statistical methods of assigning prob- 
abilities to events in the light of experience. What Estes is proposing will, 
if it can meet the problem of natural patterns, give us quantified predic- 
tions that can be compared with actualities and enable us to add en- 
lightening specifications to the description of stimuli that will put us in a 
better position for prediction. Statistical methods are also giving us a 
sound basis for measuring the objectivity of our human observer as a 
scientific instrument. 

Stimuli are, of course, not the only data needed for prediction. Past 
associations, which Hull embodies in the concept of habit strength, must 
be supplemented by information of the state of the organism, which Hull 
covers with his concept of drive. I have tended to think of drive as cov- 
ered by the stimulus situation, but the fact that in a few drives the inter- 
val of deprivation can be used as a rough indication of effect may justify 
the concept. In any case, the notion that we have in drive a variable 
measured in units and entering into the prediction of response as a multi- 
plier of habit strength is probably an illusion. 

There was another concept that Janet proposed. It has failed to 
interest psychologists. Janet, as a result of a lifetime dealing with mental 
patients, used the notion of force mentale [9] as a predictor of behavior. 
He believed that the capacity for facing decisions and for meeting new 
Situations could be related to such a dimension. With fatigue, for in- 
Stance, decisions become difficult and imposed choices lead to functional 
disorder, Janet proposed no explanation of this force mentale in physio- 
logical terms, but his use of it would suggest that it could be taken to 
include any form of impariment of the physiological mechanisms rein- 
forcing action, breathing rate, pulse strength, glandular irregularities, 
defects in metabolism, which would interfere with the action of muscles 
by depriving them of oxygen, blood sugar, or the removal of fatigue prod- 
ucts. Such defects modify profoundly the nature of psychological adjust- 
ments, Extreme fatigue is often responsible for aphasia and a sharp re- 
duction in the scanning processes of the normal organism which are be- 

ind the solutions of difficult situations, Current psychological theory 
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takes almost no account of man as an energy system or of the alterations 
in behavior that interested Janet. 

Man is an energy system and it may be assumed that, lacking the 
atomic events that involve the conversion of mass into energy, he illus- 
trates the principle of the conservation of energy. But he is not a closed 
system, and the psychological significance of behavior is not in the end 
describable in terms of energy. Flight and attack, approach and avoid- 
ance, correct and incorrect response may all involve the same energy 
output. It is only when, through malnutrition or excessive energy output, 
physical distress intervenes to act as a drive or as a stimulus component 
to modify response, that psychological categories are involved. Janet uses 
to good effect the description of situations which arouse the phenomena 
of emotional reinforcement of action, increased breathing, pulse, blood 
sugar supply, lactic acid elimination, and other factors affecting muscle 
contraction, though he does not catalogue these physical effects. H. M. 
Johnson once called attention to the behavior symptoms common to the 
effects of alcohol and the effects of fatigue. Fatigue has been notably 
difficult to measure in action though the changes it involves in the 
chemistry of muscle are definite. The psychological use of fatigue or of 
physiological impairment of the action system will remain open only to 
treatment involving general correlations between symptoms or categories 
of physical condition and classes of behavior effects. It will not approach 
the prediction of specific acts except in connection with other informa- 
tion. Its predictions will resemble the recently popular fad for linking 
frustration and aggression. These are both “fuzzy” categories, hard to 
define and not productive of close agreement in cases. And their use by 
psychologists has been rather more as ethical justification than objective 
description. But they are not without meaning or use. An attempt to 
use the psychological observers systematically would probably show that 
frustration has often other consequences and aggression other causes. 

Janet’s use of force mentale illustrates a field of behavior prediction 
in which the predicting categories are basically physiological, whereas 
the predicted behavior patterns are psychological. He explains, for in- 
stance, cases in which neurotic symptoms like indecision, extreme fatigue 
not the effect of ordinary exertion, phobias, and many other symptoms 
disappear during a period in which the individual discovers new sources 
of self-stimulation, like quarreling, stealing, dangerous sport, arson, as 
based on the evocation of normal emotional reinforcement which re- 
stores the normal process of adjustment. 


The General Nature of Theory 


A scientific explanation of a class of events states the rule of the 
event. Theory goes further than that. Theory arranges the rules in a 
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hierarchy of generalizations. Specific laws are in their turn explained 
(or reduced) by showing that they are instances of more general rules. 
The explained rules are not discarded, because they are more readily 
applied in their specific forms. 

An illustration of reduction is the attempt to reduce the law of ex- 
tinction of conditioned response following repeated elicitation to a more 
general statement of association by contiguity, or in other words, to show 
that the phenomenon of extinction and also the failures to obtain ex- 
tinction are both explainable by the law of association. Another example 
would be the attempts to explain associative inhibition as always a case 
of the association of an inhibiting response rather than as an independent 
rule, 

The reasons for such attempted reductions are that they condense 
and simplify the rules that must be learned and at the same time cover 
the phenomena more adequately. The writer has made efforts to show 
that the phenomena of conditioned inhibition, extinction, reinforcement 
by effect, improvement by practice, the increase of certainty of a CR to 
a given CS with repetition are more adequately dealt with by the general 
principle. As stated, they all have exceptions and their failures to predict 
can be understood as cases of simple association. 

Hull adopted and made his primary law a modification of Thorn- 
dike’s Law of Effect which had been developed by C. Lloyd Morgan 
[10] and by Hobhouse [8]. A signal originally not eliciting a particular 
Tesponse becomes a signal for that response if the association is “con- 
firmed” or “rewarded” or “reinforced” by the stimuli that it brings about 
or by the reduction of drive that it causes. 

This can be illustrated over and over. But there are many ambiguities 
about what constitutes reward or reinforcement. As Skinner uses the term 
“reinforcement,” any consequent that increases the certainty of the re- 
Sponse is a reinforcement. 

The sequence of CS, CR, and reinforcement can be viewed from an 
entirely different point of view. When a signal has been followed by a 
response that prevents the signal from recurring, there is no opportunity, 
on any theory of learning, for the attachment of that signal to other 
Tesponses. If an association is established, it is secure from alienation 
until the signal is again present. This alienation will not occur, on any 
theory, until the CS is again present and a different R is introduced. The 
association js protected from unlearning, whereas if the signal remains 
Present there is opportunity for new behavior to replace the response to 
the cue and the action may be unlearned. This makes specific the nature 
of 4 reward or reinforcement. They are defined in terms of straight asso- 
ciation by contiguity. They protect learning that has occurred by making 
unlearning impossible. The only theory that would allow the association 
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to be destroyed in the absence of the signal is the rather discredited 
theory that forgetting is a function of pure time, and that would operate 
against a rewarded association as well as an unrewarded one. 

The writer has no adequate models to suggest. We can always ask 
whether a machine could be constructed to do what the human organism 
does and such a question has great value. Most of us accept the notion 
that animals and man are both physical structures and that they operate 
in some sense as machines. Motion pictures of the journey of phagocytes 
through the interstices of a blood vessel wall to the area of a bruise and 
their subsequent return, engorged with the debris of the escaped blood, 
through the walls of the blood vessel to the blood stream give a new im- 
pression of the marvelous economy of the physical body. Hints from the 
records of the electrical activity of the brain and new developments in 
electronic physics have thoroughly done away with the conception of the 
brain as like a telephone exchange. Even the modern telephone exchange 
no longer resembles much the earlier form that was familiar in the text- 
books as a model of brain action. 

Models serve to help us conceive the relations between stimuli and 
responses, but they follow theory and do not lead. Not many of us under- 
stand the intricacies of a modern analogue computer, but there are strik- 
ing analogies between human thinking and the new machines which will 
probably be exploited in the explanation of behavior. 


Scope of Theory 


A great deal of psychological information will continue to have little 
relation to theory, as is the case in medicine. Specific devices and meas- 
ures that have good effects in practice are scientific only in the sense that 
the methods and the effects are described in objective terms and their 
relation verified by the use of accepted statistical analysis. It is in the 
great advances in this field that psychologists have made their chief 
contributions during the last generation. Methods begun by workers in 
psychophysics have contributed a large share to this development, in 
spite of the fact that the interest of psychologists is no longer devoted 
to the analysis of sensation. Although psychologists have contributed to 
their development, statistical methods are not limited to psychology in 
their application and in that sense are not psychological. Some of them 
were developed in agricultural research, some in medicine, and some in 
sociology. 

What is properly called psychological theory is made up not of mod- 
els, and not of the isolated practical generalizations which guide practice, 
but of a body of more general statements which make up a hierarchy of 
rules in which more general rules form the explanatory basis for the more 
specific and serve as the conceptual structure which is a mnemonic device 
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for teaching a science and as a guide for research. It was Hull’s ambition 
to construct such a system. 

Since axioms are really definitions and the rules of deductive reason- 
ing are the folkways of human discourse and not absolute or cosmic, a 
system is not ultimately right or wrong but useful or not useful in the 
public in which it is used. One very great danger is inherent in the 
adoption of any system. Once it is agreed upon, the future development 
of a science is given direction by it. The writer’s chief objection to any 
hypothetico-deductive system is not so much to the use of systems as to 
the danger of a premature restriction on the search for facts. Postulating 
a Thorndikean Law of Effect discourages research into the nature of the 
effect and into the problem of how specific effects influence the develop- 
ment of learning. A system may be productive of research, but research 
has no value in itself. It is knowledge that we are after rather than re- 
search and the test of a system is the light it throws on an area, and in 
psychology, not just the amount of prediction it makes possible, but the 
ability to predict what is worth prediction. Practically all research results 
in prediction, but if it is merely the prediction of how rats will behave 
under certain complicated conditions found only in a number of psycho- 
logical laboratories, we have not furthered knowledge or science. 

On the other hand, without a point of view, without some system 
there is no science. It was the restriction of physics to description in terms 
of mass, length, and time that made physics a science. In the writer's 
opinion, the restriction of psychology to objectively describable behavior 
is necessary for the establishment of a science of psychology. The psy- 
choanalytical account of man will not be scientific until that restriction 
is observed. Janet remarked that the concept of the unconscious could 
be used scientifically only if it were used to name certain types of observ- 
able behavior. Explanations of animal behavior in such terms as “The 
animal sees that . . . ” or “The cat thinks that...” or “The rat 
expects that . . . ” are allowable only when specific, recognizable ac- 
tions are indicated. As stated, they use language categories. What follows 
the word “that” is a statement. Animals do not have language and the 
description of their actions implying language is metaphorical and not 
scientific. We can observe action which is legitimately described as “The 
cat sees the mouse,” or “The cat hesitates and then... ” or “The 
cat responds to the sight of the release mechanism with posture and 
movements appropriate to escape.” If we use the word “expects” to 
indicate a type of observable action we are on safe ground. The dog 
that cringes at the sight of an upraised hand may be described as expect- 
Ing a blow, 

; If our interest is in system we may disregard the long-distance pre- 
dictions of behavior that can be made from a knowledge of the species, 


174 EDWIN R. GUTHRIE 


from a knowledge of the culture, from a knowledge of the past of the 
individual, and give our attention to the moving event, the immediate 
weather signs of the next action. The data here include certain states of 
the organism: fatigue, excitement, depression, physiological drives in 
hunger, thirst, sex, oxygen—carbon dioxide imbalance. 

We have, in addition to these, the present stimuli to which the animal 
is responding. Such a prediction basis would make a physicist despair. 
Despair, however, is not in order because there is open to the psychologist 
the same chance to minimize the complexity of his generalizations by 
keeping some of the data relatively constant or partialling out their ef- 
fects by statistical devices or by resigning himself to high error valences. 

As psychologists, our chief concern is with the phenomenon of learn- 
ing. This process is what we usually mean’ by the word “mind.” The 
possession of a mind means the ability to learn. The word “learning” 
has a variety of meanings. The Russians use the same word for “learn” 
and for “teach,” as does an occasional American. A great source of con- 
fusion has been the tendency to use the word with a value sense as mean- 
ing “improvement.” It is probably this use which accounts for the strong 
appeal of reward theory and for the misuse of Pavlov’s term “reinforce- 
ment” for “reward.” 

It has been mentioned that the advance of physics dated from its 
convention to neglect most aspects of the world and to abstract from the 
world’s complexity, for purposes of description, certain of its features 
most easily agreed upon. Man is a complex physical system, but physics 
can describe him only in terms of mass, length, and time. 

A like abstraction must precede the development of a science of mind. 
The first abstraction, now a matter of general agreement, is to limit the 
account to behavior and neglect the phenomena of inner experience ex- 
cept as they can be translated into action. 

Within the limits set by a restriction to behavior, there is a deep- 
seated difficulty that psychologists still confront. This difficulty, in the 
opinion of the writer, will always prevent the development in psychology 
of a systematic basis resembling that of physics. It means that the life 
sciences cannot be reduced to the same predictive basis as the physical 
sciences. The difference is a matter of degree, not of kind. It centers 
about the description of behavior in terms of purpose. The quintessence 
of living behavior, and particularly that area of living behavior we call 
mind, lies in its purposiveness. 

The growth of the language tools we use to describe the world is by 
extension of symbolic meanings from the recurring experiences which 
give rise to nouns and verbs based on those shared experiences. The no- 
tion of the atom—the uncuttable, the irreducible bit out of which matter 
is made—had its start in noting that changes in the shape of objects 
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could be rearrangements of particles which themselves did not change 
shape. The Greek atomists were close to this primitive meaning. But even 
modern science, which has found it no longer necessary to think of the 
atom as “uncuttable,” tends, at least for the sake of communication, to 
speak of “particles? which make up the atom, though sophisticated 
physicists, who confront the conversion of matter into energy, are well 
aware that this word is inadequate. 

In like fashion, the notion of purpose, which had much the same 
linguistic history as “propose” in that both were made of words meaning 
“set forth,” was descriptive of a behavior category of great importance 
in predicting behavior. What men propose can be used profitably in 
predicting what they will do. Biological science has been forced to use 
the word “purpose” to describe events and structures which are no 
longer assumed to have a “proposer.” The concept is extended to cover 
those events in which the outcome can be predicted although the means 
by which the outcome is achieved exhibit variety and cannot be pre- 
dicted. A seed of corn will germinate in a wide variety of circumstances. 
It is not the seed that proposed this, but the observer who now predicts 
the outcome even when he has no notion of the physical process by which 
it will be achieved. We can predict certain acts of men without asking 
what they purpose or propose. In the case of our own acts we are seldom 
aware of the movements by which they will be carried out. We may not 
even be aware that we possess a repertoire of movements leading to the 
end achievement. But every attainment of the result is carried out by a 
specific set of movements, depending on the situation and on the past 
learning of the individual. 

Purposive behavior is defined in terms of its end results, not in terms 
of process or in terms of its antecedents. Purposive descriptions specify 
the outcome of action but do not specify its course. We do not need pur- 
posive terms for describing invariable sequences. In fact, invariable se- 
quences are what constitute mechanism. A mechanism like an automatic 
pilot may achieve a definite end result through an indefinitely large 
number of ways, but each of the sequences by which the result is 
achieved we assume to be definitely predictable once the initial situation 
is given. The practically infinite number of different sequences by which 
the automatic pilot brings the ship back on its course from any devia- 
tion from a class of sequences whose general law is known. A body in 
stable equilibrium is a similar case. Any of an infinite number of dis- 
turbances of that equilibrium will end in a restoration of the original 
Position, 

Human behavior has a strong resemblance to the body in stable 
equilibrium or to the automatic pilot. The physiologist’s conception of 
homeostasis describes certain end states, like body temperature or the 
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concentration of mineral salts in the blood stream, which can be pre- 
dicted while the body is in working order. They can be predicted with- 
out knowing the specific mechanisms involved—predicted merely on the 
basis of observation. But the physiologist assumes that there are the 
necessary mechanisms which form the basic explanation for the main- 
tenance of the constant states. 

When we reach to grasp an object, the course of the hand is subject 
to correction through vision much as the automatic pilot corrects the 
course of the vessel when the ship veers right or left. Purposive behavior 
that is predictable in terms of end result in a variety of initial circum- 
stances and a consequent variety of sequences does resemble the opera- 
tion of an automatic pilot. A human pilot is dependent on the move- 
ments of his compass. Both pilots respond to compass changes by ap- 
propriate movements which turn the wheel. In one case, the course is 
set by adjusting the position of an indicator on a scale of degrees. In the 
other case, the course is set by a phrase spoken by the mate. We assume 
that the movements of both and their effects are describable in terms of 
physics. The energy of one is furnished by the ship's cook and of the 
other by fuel, through engine and battery and motor. 

But the analogy cannot be pushed too far. We assume that the 
steersman is a machine but not a machine whose behavior is describable 
in mechanistic terms. It requires a psychological description, One reason 
for this is that the responses of the automatic pilot form a simple class 
of mechanistic sequences whose law is known, but the steersman is sub- 
ject to determiners of many sorts. His attention wanders. The attention 
of the automatic pilot is superhumanly concentrated on the compass. 
The steersman is influenced by his public relations, his employer rela- 
tions, his personal relations, his state of health and digestion, his number 
of hours without sleep, his training, his religion. The automatic device 
has some faint analogies to these determiners, such as corrosion and lubri- 
cation. But these are more controllable and less complex. The outcome is, 
therefore, more predictable. 

The predictions of science have a peculiar characteristic. They are 
never made in absolute terms, but only as contingent predictions. This is 
illustrated if one tries to press a meteorologist to make an absolute and 
unqualified prediction of the weather, which, like human behavior, has 
many determiners, hard to combine. All the meteorologist’s rules are 
contingent. If A, then B. But if A’, then B’. If A’ and A are both present 
and B and B’ differ and are incompatible, the event is not covered by 
our knowledge and we require another generalization in which AA’ is 
the antecedent. Combining all possible contingencies into an absolute 
prediction is not meteorology. Absolute prediction can be done only on 
an ad hoc basis by searching for a state of the world as like the present 
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combination of circumstances which confronts us, and predicting what 
followed on this previous occasion. That is weather prediction, but it is 
not a science of weather. 

Several recent efforts to introduce absolute prediction into psychology 
have been made. Hull’s effective reaction potential attempted to combine 
all known determiners into an unqualified prediction. That requires not 
only a set of generalizations about particular determiners on a contingent 
basis, but a rule for combining them. All that could be suggested for their 
combination was logical multiplication, and the effort was probably very 
premature, 

The more or less successful efforts of psychologists to make predic- 
tions without qualification and so meet the problem of multiple deter- 
miners are in the fields of testing, which require no psychological theory 
and are attacked by the methods of statistics. How the results come about 
is a matter of indifference. 

A complete reduction of behavior to statements of how effects are 
achieved would carry the analysis to physiological mechanisms. There 
are certain difficulties in the way of this reduction. The mechanisms we 
discover in the human body include the response of sense organs to stim- 
ulation, the initiation and conduction of nervous impulses, the initiation 
of muscular contraction. There are hints at other mechanisms, which we 
do not understand, in the brain waves. “Understanding” them would 
involve discovering the rule of their operation. We assume these physio- 
logical events are all mechanisms in the sense that they are describable in 
terms of fixed sequences of physical and chemical change. 

That assumption will not work for the behavior of the steersman on 
a psychological level. The reason it will not work is that all our linguistic 
apparatus for describing behavior is developed in purposive and not in 
mechanistic terms. It is the end results of action that are of importance 
to us and for which we have developed words. The specific movements 
by which these end results are accomplished vary and have no names. 
We do note occasionally that an individual exhibits his own characteristic 
Style of execution, but names for individual styles are not available. Hu- 
man acts and animal acts are all named in terms of their end results. 
Signing a paper, uttering a word, turning on a light, pressing a lever, 
going to bed, adding two numbers, all name end results which are open 
to observation, can be objectively recognized. But they are names for 
classes of movement—responses infinite in variety. Our movements de- 
termine our signature and its genuineness can be established in court by 
agreement of expert witnesses. Movements peculiar to the individual re- 
Sult in the accent he gives a word. There are a few class names for dia- 
lects but the features that enable us to recognize a friend’s voice will 
never have names. Nouns and verbs apply only to classes of events or ob- 
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jects. Proper names are not material for science except a possible science 
interested in their general features. 

The attempt of psychologists and physiologists to isolate the basic 
mechanisms which explain learning is a laudable one. We must assume 
these explanations exist. We shall probably continue to search for the 
mechanistic sequences that change the effect of physical stimuli and so 
form the basis of learning. Pavlov’s work represents such an attempt. 
Pavlov tried to use simple physical change at sense organs as stimuli and 
to record salivary flow as response. The handicap we face in such efforts 
lies in the fact that in an intact animal the functioning of physical 
changes as stimuli depends on the previous organization of these into 
discriminated patterns, and on the fact that responses likewise are so or- 
ganized. Pavlov believed he was avoiding this difficulty by dealing with 
a reflex, but it must be granted that he did not succeed in getting down 
to a reflex level. His talk of such things as a “‘defense reflex” betrays his 
dependence on description in terms of end result, and his stimuli involve 
complicated organization into perceived patterns. My colleague Roger 
Loucks has attacked the problem of the basic mechanism by placing 
minute coils within the central nervous system and studying the opera- 
tion of induced electrical shock as eliciting stimulus and as the signal 
in associative learning. 


SUGGESTIONS TOWARD A SYSTEMATIC THEORY 


This section is being written in response to a request that its writer 
attempt to take the role of a systematist and state in orderly form the 
outlines of a systematic account of learning. What has preceded is in re- 
sponse to a suggestion that the system be introduced by an account of the 
orienting attitudes that have “determined the aims, inductive basis, con- 
ceptual content, or formal organization of the system.” At this point a 
confession is in order. The system will not go beyond orienting attitudes. 
Perhaps the rudiments of a system will appear as an attempt is made to 
follow the rubrics of the suggested outline. 

The independent variables. From what has been already said it is 
evident that the prediction of immediate behavior (passing over such 
long-term predictions as can be made from species, culture, life history, 
and like data) is made from observed stimuli. The ambiguities current 
in the use of this word are serious and they have been fully realized by 
many psychologists. For the purposes of this chapter we may make use of 
a convenient Aristotelian device and define stimuli as the physical 

‚ changes which are potential occasions for the initiation of sense organ 
activity and consequent afferent activity leading to response. In the ab- 
sence of a number of conditions, the physical stimuli S, though present, 


R 


yee Te Se 


Association by Contiguity 179 


may not be effective. A failure of attention, for instance, may prevent 
response. This prevention is not merely a question of the control of the 
actual impact of the change on sense organ, but involves adjustments 
within the sense organ (like the adjustment of the lens in the eye, or the 
tympanum of the ear) and also involves central coordination which pre- 
vents response through inhibition. We need not only a word for the po- 
tential physical stimuli S but for the potential physical stimuli that stim- 
ulate. We might call these functional stimuli. The most common use of 
the word “stimulus” by psychologists assumes far more than the presence 
of physical stimuli. It assumes that, by innate structure or experience of 
the individual, certain patternings of physical stimuli will be effective 
as patterned. Only observation can determine this effectiveness. In ob- 
serving the behavior of an animal or of a person, what we note is not 
the mere availability of the physical complex but its effectiveness as an 
occasion for response. The term “functional stimulus” might be used to 
indicate that our datum is more than the physical stimulus, but the use 
of the same word would lead to confusion. It is here proposed that a 
word already in common use might be adopted and the functional stimu- 
lus be referred to as a cue or as a signal. A cue C might be defined as a 
physical stimulus which is (or has been in a similar situation) observed 
to alter response. We speak of animals as responding “to the sight of the 
lever,” “to the sound of the food pellet,” or of a person as responding “to 
the hearing of his name,” or “to the sight of a nonsense syllable.” We are 
assuming in both animals and man a previous experience that has made 
a class of physical stimulus patterns, often with great diversity in its 
members, serve as the effective occasion for response. We assume that 
the pattern of physical stimuli is discriminated, normally as the result of 
previous experience. In human learning, and there is reason to believe 
that the same applies to animal learning, this means that the pattern 
be perceived in its own right. The gestalt psychologists emphasized per- 
ceptual response, but they avoided any speculation as to how this came 
about. We must agree that any account of going behavior must recog- 
nize this state of affairs. 

The limitation of functional stimuli C to those patterns that are ob- 
Served to be effective in initiating response does not mean that this ef- 
fectiveness must be verified on each occasion. It is possible to establish 
agreement on general effectiveness and to assume that a given physical 
Stimulus will be a functional stimulus. In Pavlov’s experiments he used 
bell, buzzer, touch, presentation of food, all of which could be thus as- 
Sumed to be cues or signals. The observer's role in establishing a stimu- 
lus S as functional does not need confirmation on each occasion. After 
this definition, it is perhaps needless to point out that it is presence or 
absence that is the item of observation, and not intensity. 
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In my Psychology of Learning it was suggested that we must recog- 
nize the role of pattern as such in stimuli and also that patterns must be 
responded to in their own right to act as associative cues [6]. They must 
not only be discriminable, but discriminated. Nonreaders, for instance, 
are generally found not to see the letters or words that they are pre- 
sumably looking at. Fernald had remarkable success in bringing about 
the necessary discrimination by compelling the child to trace with a 
finger the pattern of a word written in large characters. The association 
was then established, usually without requiring a repetition. Tracing 
compelled notice. Notice made possible the association with the utterance 
of the word. 

Intervening variables. By this time it has become clear that the term 
“variable” is, except in a very limited sense, foreign to the line of thought 
here expressed. The essential feature of both independent and dependent 
“variables” is their presence or absence. The same is true of such inter- 
vening variables as may by others be demonstrated to be involved in this 
exposition. As variables they are capable of assuming only the values 
one or zero. In the application of the present “system” to complex prob- 
lems of the acquisition of skills, the variable N, representing a count of 
repetitions of an event or members of a class of events, is clearly involved. 
The other variables are not established by counting but by recognition 
of presence or absence. 

Others have maintained that the writer’s use of movement-produced 
stimuli is as an intervening variable, though there is some tendency to 
classify this as a hypothetical construct. Another hypothetical construct 
could be added, the serial response. Both of these are often used for the 
explanation of such phenomena as generalization, remote association, the 
elaboration of the notion of “successful” movement, and in other ways. 

Movement-produced stimuli are not regarded as differing in any 
essential from other stimuli either in their sensory modes or their role 
in constituting cues or signals. They may be the objects of attention and 
therefore dominate action. They enter into a feature of behavior to 
which modern psychologists devote no research, although it must eventu- 
ally be studied and explained. This is the familiar phenomenon of re- 
sponses (both specific movement- and act-responses) which appear to be 
maintained without attention—which become self-sustaining. These are 
involved in all skills and in all habits. We are forced to recognize a cer- 
tain character of independence in them which throws some doubt on the 
adequacy of the language which makes integral response the final basis 
of an analysis. Henry Rutgers Marshall in his book Consciousness [11] 
observed that a mark of consciousness was the close relation between 
what we were conscious of and what will here be called integral response. 
We judge a cat to be conscious of an object when its integral behavior 
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has reference to that object and is a response to it. This is a close ap- 
proximation of the point of view here being stated in terms of attention 
and total response. But in view of the relative independence of certain 
local systems of serial response, such as the repetitive movements all of 
us make without noting them or remembering them, it is tempting to 
add a category of detached movement responses which are evident 
throughout a varied succession of actions. These movement details be- 
come conspicuous when they bring disaster, when we do not “look 
where we are going.” It is possible that the linguistic solution is to rec- 
ognize that responses of attention form a special category of responses. 
These are subject to a system of change (continuous ranging or scan- 
ning) that is relatively independent of movement routines of other sys- 
tems. They are also subject to elicitation by intense stimulation of any 
sort and to the simultaneous presence of cues which are signals for in- 
compatible integral responses. Reaching for one’s keys is a serial response 
which can be executed without attention and not later remembered (no 
new associations formed), but if the keys are not in the usual pocket, 
the interruption of the automatic movement elicits attention. 

Serial responses have been used in explaining rewarded behavior. If 
we assume that a series of movements once executed establishes each 
movement phase (and its results in exteroceptive and interoceptive 
stimulation as well as proprioceptive) as the cue for the succeeding 
movement, the series may be treated as one response. Skinner suggests 
that such series are formed in respondent conditioning, and only the 
first movement remains to be explained in terms of operant conditioning, 
My own suggestion is that the first movement has its signal if we look 
for it and the assumption of this will lead to its discovery. 

The strong interest in automatic recording instead of observation 
has led psychologists of the present generation to overlook the complica- 
tions involved in learning. I believe that advances in understanding what 
takes place in learning can be made only by patient observation of the 
details of the behavior involved. Movement-produced stimuli and serial 
Movement responses are open to observation as are the encountered 
stimuli which originate in the environmental situation, but their obser- 
vation requires more than recording the instant of a bell or a pin prick, 
or a shock, or a light flash. They require looking at the animal. They 
are not, however, as open to the experimenter’s control as are the 
sounds and lights so much used as signals in current research, 

In the present use of movement-produced stimuli and serial response, 
these concepts are perhaps more to be described as hypothetical con- 
Structs than as intervening variables as these terms are used by the au- 
thors of Modern Learning Theory. In the writer's view they are neither 

intervening,” in the Tolmanian sense of potential reduction to inde- 
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pendent variables, nor are they hypothetical. Both are open to observa- 
tion if observation is directed at them, but both, in most experiments on 
association, would be assumed rather than used as data for practical 
reasons. 

The use made of attention assigns that to the same equivocal posi- 
tion as intervening variable or hypothetical construct. Like movement- 
produced stimuli and serial response, attention intervenes in the causal 
chain between stimulus and response, but is not reducible to the con- 
tiguity formula but ancillary to it. 

It would be very gratifying, from a systematist’s point of view, if 
attention would behave as a simple entity, reducible in some manner to 
a number and so capable of entering into a formula. We make an as- 
sumption that is very dangerous when we go so far as to speak of what 
attention can do. In the strictest sense, “attention” names a variegated 
class of responses which have in common only the fact that they involve 
movements of orientation, changes of posture, the inhibition of move- 
ments that interfere with orientation of sense organs, and minimize dis- 
traction. All that we can safely assert is that there are such responses and 
that they represent a high degree of integration or coordination. Some 
of them are native responses to certain classes of physical stimuli and 
can break in upon going behavior. The tendency of an infant’s eyes to 
follow a moving light, or the general behavior of listening when a sound 
“stands out” from the general background of noise, is an illustration. 

Attention-value can surely be acquired through learning, and this 
does not need demonstration. Many of the movements of attention are 
comparatively independent of going behavior. The chief characteristic 
of the general class of responses we call attention is a tendency to be 
centered upon some restricted area, though some division of attention 
is possible. We may be listening for an absent noise or watching for an 
object or movement not yet visible. Another characteristic is the be- 
havior of scanning, sharply reduced and centered when some learned or 
unlearned cue results in fixation. Arne Towe, in an unpublished doctoral 
thesis, has some evidence that on successive exposures to a visual pattern 
which has been made a signal for pecking in pigeons, a bird varies the 
“aspects” of the visual pattern to which it is responding. 

The importance of attention in determining response by the control 
of the signal has been well stated by Broadbent [1]. The importance of 
attention for learning theory lies in the fact that all theories which use 
stimuli as the predictive antecedents of response represent the response 
as occasioned by the signal, whether the connection is established by as- 
sociative contiguity or through reinforcement. Attention, involving the 
nonreception of a signal, can, for S-R theories, prevent response. Strong 
connection, if we believe we have a measure of associative strength other 


Association by Contiguity 183 


than probability of occurrence, would not produce any more response 
than weak connections. 

Responses to signals which take the actor’s attention away from the 
signal prevent, for the time being at least, the attachment of new re- 
sponses to the signal. It is possible that this describes the essential nature 
of reward and reinforcement. They shut off the signal so that its last 
association is the response which produced the diversion. A response 
which removed a drive, if we speak in terms of drive, would operate to 
the same effect, by rendering the actor inattentive to the classes of stim- 
uli to which the actor’s condition contributed attention-value. Broadbent 
suggests that the role of association in learning is in a sense subordinate 
to the role of stimulus selection. Both are involved in any acquisition of 
skill or learning of an act. It is possible that when an animal’s response, 
to a continuing stimulus pattern is interrupted by an attention-getting 
stimulus outside the one being responded to, this might leave the re- 
sponse to that stimulus unaffected; whereas the interruption of response 
by such a cause as fatigue while still attending to the pattern would make 
the pattern a signal for avoidance. 

The distraction or interruption of attention is, of course, not the only 
way in which a signal may be removed. Action may directly remove the 
signal, or remove the actor from the signal. 

Dependent variables. What we attempt to predict in going behavior 
is action—response. All response is made up of movement brought about 
by effectors or of glandular secretion. The muscles and glands are the 
organs of response. Every instance of response is analyzable in terms of 
movement; but the classes of response on a psychological level, as dis- 
tinct from the reflex movements which are in the domain of physiology, 
are defined and most easily observed in terms of their end results. We 
recognize the writing, but if there had been no ink in the pen we could 
not without extraordinary measures specify the response or recognize the 
Movements of hand and arm that produced it. A description of the ac- 
tion of the individual muscles concerned would be hopeless confusion. 

There is here the ground of a separation between psychology and 
the natural sciences. The chains of cause and effect, the chains of con- 
nection between stimuli and responses, have an area not open to obser- 
vation by known methods. End results are attainable by a variety of 
movement sequences, a variety that does not appear in the definition 
of the act, which is made in terms of outcome. Our description is on a 
level different from movement. What we need to know is how to predict 
acts. Man, viewed as a machine, learns movements, not acts. The occa- 
sion for muscular contraction is the arrival of nerve impulses at the mus- 
cle end plates of motor nerves. These impulses are transmitted from the 
central nervous system: their ultimate occasion is the stimulation of 
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sense organs, though the brain and central system have complex and un- 
explored functions in the coordination of neural activity and may ac- 
tually serve to originate, in some sense, the outgoing impulses. The chain 
of cause and effect therefore ends in muscular contraction. The per- 
formance of an act on any one occasion represents one chain of events 
in sequence. But successive performances of an act may involve sequences 
very different in nature. 

Whatever the mechanism of learning, it must establish a causal chain 
of connections between stimuli and movement patterns. But psychologi- 
cal observation does not apply to movements as a rule. Horton and I, in 
an observation of cats escaping from a puzzle box, concluded that each 
cat accomplished the act of escape by a limited repertoire of movement 
series, of alternative series in which the specific series used could be 
clearly related, in a few instances, to differences in the stimulus situa- 
tion [7]. We were tempted to generalize this observation and conclude 
that the movement pattern of escape was initiated by the specific pattern 
of stimuli encountered. No two cats used any very similar movement 
series, but any one cat used at least one series over and over. There were 
many repeated movement series that did not lead to escape. 

Psychologists, in general, have not been interested in movement in 
observing learning. They record the end effects—escape, arrival at a 
goal box, number (but not nature) of “errors.” They may go as far as 
to put an animal into an opaque box which hides the movement series 
from the observer as effectively as would a blindfold. Our conclusion 
was that the causal chain leads from physical stimulus to Rmt, meaning 
by Rmt the integral movement response of the animal. 

Just as with practice new partial stimulus patterns S come to be dis- 
criminated and to attain status as cues C, a similar process is observable 
in response. I have in the past suggested that partial or local features of 
response may become independent of the total pattern. I would now state 
the matter somewhat differently. 

It is these local features that we observe and record. Recurring fea- 
tures of response, for which we have names, are usually all that is 
noticed, For the total complex we have no names. Muscle groups like 
those involved in speech or in manipulation can develop patterns that 
are consistent with a variety of total behavior. We can learn to talk 
while driving, though the notices often placed above the driver's seat 
in the bus cautioning the public not to engage the driver in conversation 
point to limitations in this local independence. Speaking one series of 
numbers and writing another at the same time is impossible without in- 
terference. Most animal studies in learning note one feature only, al- 
though it is only one small aspect of a response of the whole organism. 
Men can learn to add numbers sitting or standing. Few men could ex- 
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tract the square root of 2 while dodging traffic at an intersection. As a 
rule our absorbed interest in noting one detail of the situation keeps us 
from noting others. 

The partial or subresponse should probably be acknowledged as a 
part of a total response, and such a performance as uttering the same 
word sitting and standing acknowledged as two distinct Rmts or integral 
movement responses. We might distinguish between the Rmt and the 
detail of such a response in which we happen to be interested, the Rmd. 
The detail may be the same or equivalent in two different integral re- 
sponses, Rmd and R’md, as in pronouncing the same word in two differ- 
ent postures. It is probable that learning this independence of local re- 
sponse involves learning a class of integral responses which have the de- 
tail or subresponse in common. Learning an act Ra is a very different 
matter and involves establishing a class of movements Rmt, R’mt, 
R’mt . . . which all achieve the same end. An act is a class of move- 
ments having as their defining feature the attainment of the same end 
result. The learning of an act involves the presumptive association of a 
class of cues, C, GC’, C” . . . with the members of the series, Rmt, R’mt, 
R’mt . . . . It is the essential feature of acts that they are adapted to 
the variations in situation. 

Association by contiguity. The writer is well aware of his failure 
to lay down specifications for the data of learning which are sufficiently 
precise to meet the requirements of laboratory experiment. The present 
section does not offer a solution. The point of view taken is nevertheless 
felt to be in some sense compelling. 

The sense of the word “cue” here described represents almost uni- 
versal practice among psychologists. An apparent exception is Skinner's 
limitation of the meaning of “stimulus” to measures taken by the ex- 
perimenter. Since Skinner obviously chooses his measures for their tried 
effects on the reading of a mechanical counter or steps in a kymograph 
record, and since he assumes, as we would, that the animal made move- 
ments leading to these effects, his use of the word “stimulus” proves only 
an apparent exception. Another exception is the current practice of as- 
suming that the behavior to be conditioned in instrumental conditioning 
Or operant conditioning has no cues because none are in the record, and 
to infer from this that instrumental conditioning involves a separate kind 
of learning. 

As defined, Ra (act) and Rm (movement) represent uses of the 
word “response” in current ambiguous use. Skinner notes some detail of 
the end result, Rad, whereas Pavlov moted a detail of the response, Rmd. 

The principle of association by contiguity does not offer as great a 
chance for agreement as could be reached in specifying the nature of 
Stimulus or cue and response. The principle has taken many forms after 
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the adoption of the restriction to behavior made the older statement in 
terms of the association of ideas unsatisfactory. In the tradition of C. 
Lloyd Morgan and Hobhouse and Thorndike it became a principle of 
association of cue and action resulting in the combination of contiguity 
and reinforcement. Hull wrote in that tradition. With Pavlov, the prin- 
ciple became a statement of association by contiguity only. Pavlov be- 
lieved that the repetition of CS and US together strengthened the associ- 
ation, and he had no reference to a need for confirmation through ef- 
fect. Razran is dealing with this confusion in the use of the term “rein- 
forcement” in a manuscript in preparation. 

There has also been disagreement on what it is that is associated. In 
G. Lloyd Morgan and Hobhouse it was a stimulus, but whether S or C 
not made clear, and a response, which sometimes would be characterized 
as a perception, sometimes as a movement, and sometimes as an act. 
Skinner is the first behaviorist to face the issue of the nature of the stim- 
ulus squarely and decide in favor of S in the form of an experimenter’s 
manipulation of the environment; but having faced the issue he might be 
said to have walked away without solving it from the point of view of 
theory. The gestalt psychologists recognized association only in the dis- 
guised form of a tendency of insights once formed to be repeatable. It 
is here suggested that a possible statement for use would be: A cue C 
previously followed by an integral movement response Rmt which does 
not include the detail d in which we are interested, will, after it has 
occurred immediately before an integral movement response R'mt which 
does involve d, be on the next occasion followed by R'mt and not by 
Rmt. 

An alternative formulation has, in my opinion, a number of advan- 
tages in intelligibility and still embodies the essential features. This al- 
ternative is: what is being noticed becomes a signal for what is being 
done. 

The first phrase, “what is being noticed,” conveys an adequate rec- 
ognition of the active part that the organism’s own activity plays in the 
selection of physical stimuli through attention. The word “signal” is used 
as a synonym for “cue” and probably reflects the increasing use of “sig- 
nal” in communication theory. “What is being done” is to be taken in 
the primary sense of integral response. The practical use of the rule can 
be extended to cover acts (with a great loss of predictive specificity) 
when the act response is understood to include movement responses 
which serve to restrict attention to a limited field (and thus to restrict 
response) until some end movement is accomplished, and when previous 
learning has established a class of integral movement responses all lead- 
ing to the removal of the maintaining cues. 

Hull has packed as much as this into his S (stimulus) when he de- 


Association by Contiguity 187 


fines it as the stimulus energy which evokes a response on the basis of 
a previously formed habit, but this emphasizes the energy of the physical 
stimulus, whereas I believe the energy to be of minor significance as 
compared with the role of pattern. If the pattern operates at all or is 
above the threshold, energy can vary extensively without altering the 
essential effect. We respond to the sight of a friend in a wide range of 
illumination, for instance. 1 

Attention becomes, in the present account, the point at which learn- 
ing occurs. Through attention-getting stimuli, the routinized movement 
established in past learning can be broken into and new series established. 
Attention becomes routinized along with the integral response series. We 
think of the behavior as mechanized or routinized. But attention is open 
to distraction from intense stimuli or stimuli not part of the learned 
series. The omission of stimuli that are part of the series may serve to 
interrupt. An anesthetized finger would in all probability seriously inter- 
fere with playing a well-memorized piece on the piano, as would also 
a burned finger tip. In writing by hand, if attention is diverted from the 
script to the moving fingers, writing becomes difficult or impossible. At- 
tending to the script is a part of the routine. We tend to think, when 
we use the word “attention,” not of its role in behavior already learned, 
but of its conspicuous role when routine is interrupted. 

The statement of the rule of association given above involves some 
formidable assumptions, some concerning the nature of cues, some con- 
cerning the nature of responses, and some concerning the role of at- 
tention in behavior. It will be evident that the writer is engaged in 
clearing up his own difficulties rather than in expounding a theory. The 
tule that what is being noticed becomes a signal for what is being done 
is even more of an oversimplification than the rules which physics 
offers as applying to its system of ideal solids, liquids, and gases, which 
are only models for use in understanding the complexity of physical 
events, 

The following is an attempted listing of the more conspicuous of these 
assumptions. They are not formal postulates. 

1. Patterning of physical stimuli is effective as such, as distinct from 
the effects of degrees of intensity or the summation of the effects of 
stimulus elements, This is not to be confused with the gestalt notion that 
patterns have effects independent of the specific stimuli involved. 

2. The acceptance by an observer of a given pattern of physical 
stimuli as a cue for the observed organism will depend on data from the 
Past history of the organism or on present observation of the perceptual 
Tesponse to the cue. 

3. In the higher animals, the effectiveness of physical stimuli is 
Soverned by a class of responses called attention, which may restrict 
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effect by avoiding the orientation and sense organ adjustment required 
for effectiveness, or by involving postures allowing or excluding particular 
responses to the stimuli. Attention has certain natural occasions including 
intense stimuli, sudden change in situation, movement, and others. 

Attention, being part of the integral response, is conditionable as are 
other detailed features of response. The physical stimuli present in at- 
tention become cues for the action elicited. 

With intense stimuli or abrupt changes in stimulation, attention may 
take the form of fixation, and behavior be limited to response to a very 
restricted area of the physical situation. 

4. When two cues that have been associated with integral responses 
which have incompatible movement patterns are both present, action is 
withheld and the movements involved in attention become pronounced. 
Attention includes behavior which may be called scanning and takes the 
form of rapid changes in sense organ orientation. 

5. At any moment the class of integral movement responses possible 
is limited by the going action. Not only the act of running but the 
patterns of muscular contraction involved in running are made impossible 
by sitting in a chair. Running movements are dependent on “feedback” 
from their own results. 

Breaking in upon present action requires normally a diversion of 
attention by natural or learned cues, and the assumption of a state of 
readiness or transition. Such a response as stopping running, which is 
a complicated achievement, cannot be elicited unless an animal is 
running. Sheffield [13] found that the animal's going behavior at the 
time a physical stimulus was introduced when added to the record made 
possible a greatly improved prediction. No cue for stopping can be made 
effective when an animal is standing or crouching. This holds as well for 
reinforcement theory as for contiguity theory. There is no response 
present to reinforce. 

The class of movement responses included in the general term “at- 
tention” involves changes of stance and of readiness. Practical training 
recognizes this. The military use of commands to get attention, pre- 
paratory commands, and commands of execution is based on the fact 
that going behavior imposes strict limitations on the movement responses 
elicitable at any one moment. 

6. It is, therefore, assumed that rules which do not take into account 
what the animal is doing when stimulated will not be descriptive of the 
phenomenon of association. Present behavior is assumed to be determined 
in part by movement-produced stimuli which include exteroceptive, 
interoceptive, and proprioceptive stimuli. These the observer infers from 
the seen behavior. Even the judgment of an observer that certain visual 
patterns are acting is an interpretation of the observed orientation of the 
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animal. It cannot be inferred from the availability of the patterns in the 
environment. 

7. The complexity of the nexus of determiners of action requires 
that prediction allow for high degrees of error. Most instances of as- 
sociative learning are cited after the fact, and do not constitute pre- 
diction. I believe that it was Wickens who remarked many years ago 
that the presence of an associative cue appeared to a necessary but not a 
sufficient condition of the appearance of the associated response. If we 
are considering absolute and’ not contingent prediction this is of course 
true. But even in this sense the rule of association would have a high 
practical value because it would lead to the discovery of effective cues by 
limiting our area of search. 

8. An assumption that involves stimuli, responses, and the nature 
of attention may be added. It may be phrased in common terms: what 
is being noticed, as a response is elicited, becomes a potential cue for 
that response. This involves assuming that the central nervous system 
in some fashion achieves the integration of a specific pattern of physical 
stimuli as a cue and that this cue selects the response associated with it 
from then on until the cue has been associated with another response. 


INITIAL EVIDENTIAL GROUNDS FOR THESE SUGGESTIONS 


It has by this time been made evident that the general position taken 
here has grown out of common knowledge and a restricted class of ex- 
periments in learning. Most of the experimentation on learning has, par- 
ticularly since Pavlov, attempted to relate physical stimuli directly to acts. 
These are ideal data for the laboratory in the sense that they minimize 
the role of an observer’s judgment and can be made to record them- 
selves, They deal with the rules of immediate sequence in behavior and, 
therefore, promise to furnish generalizations that are in some sense basic 
for the understanding of more complicated behavior. My first reduction 
In enthusiasm over this approach came in failures to condition certain 
elementary response details like the knee jerk, and in the mixed results 
that Yacorzynski and I found in the conditioning of an arm flexion 
evoked by shock. A winter’s research attempted to discover whether visual 
patterns could be associated with nonsense syllables, when exposed in the 
Periphery of one quadrant of the field of vision, and the response be 
evoked by the figures in another quadrant. I found that an exposure too 
brief to allow fixation to travel from the fixation point to the figure did 
Not establish the association in the original quadrant and that the figure 
could not be named unless visually explored. Recognition when explora- 
tion was possible was contingent on noticing “features” such as a sharp 
angle “like a nose” and features usually describable in words. 
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Experiments in such areas as maze learning or problem solving had 
followed the lead of Morgan, Hobhouse, and Thorndike and had been 
built around the general conception of learning as improvement. The , 
attainment of a goal by means unspecified and unrecorded appeared to 
involve much more than the strengthening of a response, and did not 
answer the question of how the specific movements are achieved. On re- 
reading Thorndike, it occurred to me that every goal achievement is 
accomplished by specific means and that the real problem is how the 
organism learns, not general goal achievement, but. its own goal achieve- 
ment. In our puzzle-box watching, Horton and I found that cats in 
general learn to escape and that curves of learning, both for all the 
cats and for individual cats, could be plotted from the data by over- 
looking what individual cats did and paying attention only to such data 
as elapsed times. By our observation sessions and by an examination of 
our photographs, both still and motion pictures, our opinion was 
strengthened that what each cat learned was specific series of movements 
which are altered by many repetitions, very often radically altered, and 
by new movements which might or might not reduce the time for escape. 

Another source of subjective satisfaction with the concepts of as- 
sociative learning by contiguity was their apparent adequacy for 
describing the successful cases of interference with children’s habits in 
some hundreds of cases in Smith’s clinic. The profitable rule for inter- 
ference appeared to be to attempt to name and specify the cues re- 
sponsible for the embarrassing behavior and by any available device seek 
to produce other and unobjectionable behavior in the presence of the 
cues. This was an astonishingly useful rule in the hands of a skilled 
clinician wise in the ways of children. The principal topic in the depart- 
ment for many years was the analysis of habit changes in these terms, 
of course, usually after the fact. 

An early concern with learning curves (Ebbinghaus) which were so 
popular in the 1920s had resulted in the conviction that they were 
functions of the material to be learned rather than the descriptions of 
any basic phenomenon. The realization that even nonsense syllables in- 
volve previously acquired perception, and that associations are enor- 
mously facilitated by perception, led to thinking in terms of cues rather ' 
than stimuli. A strong behavioristic bias, dating from Singer’s paper, 
guided thinking toward the explanation of movement as the direction to 
pursue. The question can always be asked: how did this movement 
pattern reach the muscles? Biisse’s Geist und Kéerper compared the 
popular notions of interaction of mind and body with a train in which 
front and rear are joined not by a coupling but by feelings of amity be- 
tween the conductor in the rear and the engine driver in the front. The 
search for the coupling has not yet been entirely successful, but there is a 
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strong bias toward a physical chain of connection that is not satisfied with 
“reward” explanations. 

As data, cue and response are not comparable to the instrumented 
data of physics. They are subject to more errors of observation and their 
connection is subject to interference, over which control is limited. Every 
good lecturer, however, can establish on one occasion firm associations 
in students between the lecturer’s own spoken words (physical stimuli 
which experience shows will operate as cues) and implicit verbal re- 
sponses in his audience which will be testable in a later examination, 
and with enough certainty to offer a strong presumption that the student 
who does not have the answer was absent from the lecture. Paul Fields’s 
rats [4] acquire series of correct choices among successive rows of pictures 
that might tax the lecturer. The experiment in which a pairing of a 
stimulus (not verified as a cue) and a response detail is repeated 150 
times does not appeal to me as an observation of any basic type of 
learning. 


FORMULATION OF METHOD 


The suggested rule of learning, that what is being noticed becomes a 
signal for whatever is being done, has little resemblance to the predictive 
equations of physics. In actual use, or in any attempt to verify it or to 
assess its predictive error in specific classes of instances, it involves data 
from the past history of the actor as well as data open to present observa- 
tion. “What is being noticed” cannot be represented by a number and is 
the object of a consensus of observers. Conventional learning experiments 
avoid this issue by using stimuli that experience has shown to be uni- 
formly effective in procuring the actor’s attention: signals such as shock, 
sudden noise, tapping a rat’s tail, a sudden bright light. These are 
“built-in” eliciters of attention, and their use enables the experimenter 
to forget that, in the course of ordinary learning, attention depends on 
Past experience with the patterns of stimuli and on the perceptual re- 
action that has resulted. The experimenter is also enabled to forget that 
€ven these extraordinary stimuli are probably reacted to as perceptual 
Cues and that this reaction may mediate the final response. In generaliza- 
tion, for instance, the fact that a touch at a spot on the left side may 
evoke a conditioned response, although never previously associated with 
feeding, may depend on the association of touches in that area with the 
Perception of “an object on the left” and with learned movements of 
attention or defense that are practically identical for nearby stimuli. The 
confused and irregular results of experiments on generalization may be 
explainable in terms of past association and not in terms of a uniform 
function for which a general equation could be written. The confirmation 
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of this point of view would demand far more elaborate observation than 
current research provides. 

The tenses of the verbs “is being noticed” and “is being done” in- 
volve a certain vagueness about the exact instant of the word “becomes.” 
Behavior does not consist of a series of disparate moments; it is a flow of 
change. Both stimuli and movement responses may be extended over 
periods of time and the moments we choose for observation, the cross 
sections of the time series of events, lose much of the significance of what 
is going on. All responses are responsible for series of stimuli. We pick 
out certain distinguishable features for observation and record. The ob- 
server’s judgment and practical convenience set the nature of the data. 
The exact time relations are thus hard to establish. The experiments in 
remote association, which in some areas establish an optimum effective 
period of the order of a half-second between cue and response, prob- 
ably represent not cause and effect in the sense of immediate succession, 
but an interval that allows time for attention, perceptual response to the 
cue (appropriate movement or minimal adjustment), and the cessation 
of interfering activities. Longer periods allow distraction and the intro- 
duction of interfering behavior. They may not at all weaken the “con- 
nection” between cue and response but render less probable the 
occurrence of the cue. Perceptual cues take time to “mature,” as is in- 
dicated by certain lines of experimentation on reaction times, Such hypo- 
thetical links in the chain of causation are open to experiment, but will 
require arduous and detailed observation. 


QUANTITATIVE PROCEDURES 


The consistency of observer judgments is measurable in terms of 
statistical methods. On this basis we can assume that many types of 
physical stimuli can be used as data because experience has demonstrated 
that all observers will admit that they are noticed. The intense stimuli in 
current use fall into this class. Certain other classes of physical stimuli 
which are potential cues assume in different subjects a common ex- 
perience (e.g., printed nonsense syllables) or assume that an individual 
will, on successive occasions, react to the same aspect of the physical 
pattern. For any pattern or class of patterns this also is open to measure 
in terms of past observations. 

Patterning itself can be measured, in certain classes of patterns, in 
terms of number of identical elements necessary for equivalence as cues, 
or in terms of the effective proportion of elements (e.g., the analysis of 
closure), but in most experiments, if the physical stimuli are above 
the threshhold as a pattern, the significance of the physical pattern for 
the experimenter interested in learning is exhausted in noting the 
presence or absence of the general pattern. There is, of course, an upper 
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limit as well as a lower. Both require observer judgment, and psy- 
chological significance is restricted to the range between threshold in- 
tensity and the point where other factors change the cue value. Between 
these points it is a matter of presence or absence. 


FORMAL ORGANIZATION 


The writer many years ago published an article entitled Formal Logic 
and Logical Form [5] in which the rules of logic and deductive reasoning 
were held to be practical conventions. This view would contraindicate an 
interest in formal organization of a system except in those fields in which 
long experience has gradually established wide areas of general agree- 
ment on method and fact, as was true of geometry in Euclid’s time and, 
centuries later, in the fields of astronomy and physics. Psychology has 
large undigested aggregates like psychoanalysis and the data on public 
opinion, in which basic agreements have not been worked out. Present 
day clinical psychology has a strong admixture of ethics and folkways. 
Even the writers on learning theory are still using definitions of learning 
which are in different universes of discourse and interest. The use of the 
laboratory rat almost exclusively as a subject escapes the complications 
of human learning, not because rat learning is essentially simpler, but 
because we are protected from many aspects of it since the subjects can- 
not contribute their own suggestions. It will be a very long time before 
we are prepared to formalize our account. The problem of reinforce- 
ment vs. contiguity should be settled before embarking on a system. 
When the choice is made too early, and the fundamental definitions and 
categories become official and items that all graduate students must 
master for the purpose of nationwide examinations, we may find our- 
selves committed to unproductive efforts. The scholastics systematized a 
world of unripe knowledge and may thereby have protected it from its 
enemies, but at the same time they denied it the chance for progress. 


AFTERTHOUGHTS ON THE PREDICTABILITY OF BEHAVIOR 


Dr. W. F. Thompson of the Fisheries Research Institute of the Uni- 
versity of Washington has pointed out that the runs of salmon in the 
Columbia River extend over a period of about seven months; in the 
Frazer River, three hundred miles north, the season is approximately five 
months, and in the northern salmon streams in Alaska the season is 
limited to four or five weeks. The runs reflect the climatic conditions 
under which the eggs can hatch and the young fry survive their first year 
im fresh water. The severe limits of survival in northern waters are re- 
Sponded to by the development of inherited mechanisms which bring the 
Sreat majority of the run to the mouth of the home stream during the 
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weeks when propagation and survival are possible, usually after three 
years in the sea. The frequency curve of return resembles a normal 
probability curve with the mean a characteristic of the “race” adapted 
to the stream. The bulk of the run returns not only to the main stream 
left years before but to the same tributary, even if that is an artificial 
pond. 

If either the date of return or the point of return were narrowly fixed 
by inheritance, in the rare case of a very early or very late summer the 
whole race would perish. In such seasons, only the deviants would have 
offspring. It is Thompson’s opinion that the viability of a race depends 
not only on the development of an inherited return mechanism but on 
provision for inaccuracy which ensures that there will be a few individ- 
uals to take advantage of the changed conditions of climate and locale. 

There is in sexual reproduction a natural provision for the intro- 
duction of variety through the “shuffling” of chromosomes. Mendelian in- 
heritance can be predicted for the distribution of traits among large 
numbers but not individual cases in more than a few instances of single 
dominant traits. 

It is possible that natural provision for variability, and so for a 
limitation on the prediction of individual instances, is made in behavior. 
On efforts to subject laboratory animals to repeated exposures to the 
same situation over and over, we are foiled not only by incomplete con- 
trol of the situation but also by the fact that on each occasion the animal 
is changed by virtue of its previous experience and by certain char- 
acteristics of attention, which has, as in the case of the continuous 
irregular movements of the eyes, its own “reshuffling” mechanism which 
alters beyond our complete control the features of the physical stimulus 
situation available as cues. Hull covered this by allowing for a factor 

_ he called “oscillation,” but this term does not give full recognition to the 
introduction of the unpredictable in behavior which is not covered by 
assuming a normal distribution of error in results. Hull’s treatment is 
based on the fundamental assumption that response is a variable 
measured in units; but it is rather the occurrence or nonoccurrence that 
is of importance to psychology. “Oscillation” is not about a point on a 
dimension but from presence to absence of the response presumably 
dependent on the presence or absence of the cue, assuming all else the 
same. 

The attainment of high prediction in the case of a single association 
should be possible following the lead of Pavlov’s experience that better 
control of extraneous stimuli in his new soundproof laboratory with the 
experimenter not visible reduced the number of pairings required for 
uniform response from about fifty to the order of ten. If we take pre- 
cautions to have the subject in the same posture or movement and to 
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ensure attention at the proper moment, it is possible that we will reach 
a practical certainty of response with one association. This cannot be 
achieved in maze learning which leaves both factors uncontrolled. It was 
not attainable by Pavlov’s methods which used physical stimuli extended 
over long periods during which changes in attention were inevitable. 

The high predictability of human behavior referred to earlier in this 
chapter attaches not to the elementary event of the formation of an 
association but to the complex results of associative learning that has 
already taken place. Habits involve associations by the score or by the 
thousand. Once they have been established they are extremely stable. The 
making and breaking of habits involves reassociation and new associa- 
tions. It is these elements that are so hard to deal with in the laboratory; 
but it is quite possible that if we undertake a line of careful observation 
and control we shall be in a position to accept or reject or to reformulate 
the rule of learning. 
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INTRODUCTION 


The plan of this paper is to begin with a rather detailed consideration 
of some hypotheses about approach-avoidance conflict behavior as a con- 
crete example of theory construction. I believe that the main problems 
of theory construction are more meaningful if discussed in the context of 
a real example. This more formal section of Part I will be followed by 
some informal remarks on the strategy for investigating significant aspects 
of conflict behavior not covered by the hypotheses. 

In Part II I shall comment freely on a potpourri of topics that have 
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especially interested me. First I shall direct attention to a number of 
changes in conventional S-R concepts which considerably increase their 
power and range. Then I shall consider certain controversial problems 
which need additional theoretical and experimental analysis—e.g., 
creative thought, flexible cybernetic behavior, new trails for research 
on motivation, and theories of reinforcement. I shall touch briefly on a 
few of the main points of my collaborative work with John Dollard on 
social learning, personality, and psychotherapy. Working toward the 
ultimate goal of a basic science of human behavior, we have tried to 
integrate certain contributions of the laboratory, the clinic, and the 
socio-anthropological field study. Finally, I shall deal with the experi- 
mental design required but seldom used to justify intervening variables, 
with implications for psychopharmacology and brain stimulation, and 
with the strategy of a rigorous though qualitative approach to problems 
of quantification. 


Part I. Approach-Avoidance Conflict Behavior? 


BACKGROUND FACTORS 


My attitudes towards psychological theory have grown out of an 
early interest and training in physics, chemistry, and biology. These atti- 
tudes were further molded by study with Smith and Guthrie at the 
University of Washington, and especially, by extensive work with Hull 
at Yale. The general method of analysis of approach-avoidance conflict 
and the contents of the assumptions involved come directly from Hull. 

The interest in the topic of conflict behavior is part of a general line 
of development started by courses in personality with Terman at 
Stanford, encouraged by the general atmosphere of the Institute of 
Human Relations under Mark May at Yale, greatly facilitated by a 
brief psychoanalysis with Heinz Hartman in Vienna, and sharpened 
a implemented by intensive collaborative work with John Dollard at 

ale. 

At present the systematization is at the purely behavioral level, but 
there is no reason why variables at the physiological and sociological 
levels should not be introduced. I believe that, in general, the most 
Profitable level of analysis will vary with the specific phenomena being 
Studied and also with the particular contemporary state of development 


* Another exposition of a similar topic has been presented elsewhere [63]. 
Rather than try to paraphrase certain statements which I have already labored 
to clarify, I have taken the liberty of plagiarizing a few sentences and even para- 
graphs from this earlier article of mine. 
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of the disciplines involved. In other words, it is unsafe to make sweeping 
generalizations about the most desirable level of analysis since a tech- 
nological or theoretical breakthrough can completely change the picture. 

Specific attitudes toward theory. Pure empiricism is a delusion. A 
theorylike process is inevitably involved in drawing boundaries around 
certain parts of the flux of experience to define observable events and in 
the selection of the events that are observed. Since multitudinous events 
could be observed and an enormous number of relationships could be 
determined among all of these events, gathering all the facts with no 
bias from theory is utterly impossible. Scientists are forced to make a 
drastic selection, either unconsciously on the basis of perceptual habits 
and the folklore and linguistic categories of the culture, or consciously 
on the basis of explicitly formulated theory. 

Nature of scientific theory. Scientific theory is only an elaboration 
and formalization of common-sense processes of perception and com- 
munication, Theory is trying to make sense out of observations—to ab- 
stract and generalize. The goal of theory construction is to produce a 
parsimonious system of symbols (in either verbal or other forms) that 
can be used to make rigorous deductions about some of the consequences 
of different sets of conditions. 

As I see it, there is a continuum from extremely simple to complex 
examples of theory construction, A simple example is a smooth curve 
which summarizes a set of empirically determined values. In this case the 
deduction consists of following certain simple rules to read the curve in 
order to predict that a given value on the abscissa will yield a certain 
value on the ordinate. If an empirical equation is fitted to the curve, the 
solution of that equation constitutes the deduction. At a considerably 
higher level of development, the equation may be rational instead of 
empirical. High-school geometry is another familiar example. It consists 
of a set of definitions and axioms, or in other words, terms and rules for 
manipulating the terms. This relatively limited number of basic terms 
and rules can be used to deduce the consequences of the great number 
of different conditions. Every one who has mastered the system will 
agree, for example, on the deduction of the Pythagorean theorem. Such 
agreement is the goal of rigor. 

Being able to state any orderly relationship, or in other words, to 
summarize a set of data parsimoniously by any method is quite an 
achievement which leads the scientist to believe that he is on a good 
track. The greater the parsimony (e.g., ratio of facts to assumptions), 
the greater the achievement and the more likely the scientist is to con- 
clude intuitively that he is on the track of a fundamental regularity in 
nature. 

The principles of lower-level theories may often be deduced from 
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the higher-level ones until finally we reach something like the Maxwell 
electromagnetic wave theory or the General Theory of Relativity. There 
is obviously an enormous difference between these higher-level theories 
and the simple empirical curve, but in at least certain respects, the dif- 
ference seems to be more one of degree than of kind. 

Function of definitions. For example, in order for any of the fore- 
going types of abstract statements to be useful, the scientist must have 
some relatively unambiguous way of relating the terms in the theory to 
the phenomena that interest him. In order to use the simplest empirical 
curve, he must have some way of relating the values on the abscissa and 
on the ordinate to relevant observables. This process is not nearly as 
simple as it seems, since many assumptions may be implicit in the process 
of observing and defining events and ordering them into even primitive 
scales. The orderliness of the curves and the generality of their applica- 
tion may depend upon the particular type of measurement used—for 
example, whether one uses the expansion of a column of water or of 
hydrogen as the measure on the base line against which to plot other ob- 
servations. The water will yield complex discontinuities at its boiling and 
freezing points. The hydrogen will yield simpler, more orderly curves 
over a much wider range, and hence, is selected as a better measure of 
temperature. 

The linking of the antecedent conditions and consequent deductions 
of the theory to observable phenomena is accomplished by definitions. 
Carnap [19] has called such definitions reduction sentences because they 
teduce the terms of the theory to observables. An operational definition 
is a special kind of reduction sentence. Often psychological theories fail 
to be useful because there is no practical way of relating both the ante- 
cedent conditions and the deduced consequences of the theory with 
events that can be identified unambiguously and publicly. 

Carnap [19] has pointed out that many of the definitions which 
Scientists use to connect theoretical terms with observables are incomplete. 
They may be called partial definitions. The advantage of the partial 
definition is that it does not completely limit the meaning of the term 
once and for all but allows room for expanding the meaning, step by 
step, on the basis of accumulated knowledge. For example, temperature 
may be defined by the expansion of a column of mercury. This definition 
IS rigorous in that the appropriate use of a mercury thermometer is suf- 
ficient to determine the temperature of a body; it is partial in that many 
other effects, in addition to the expansion of mercury, may be linked 
with temperature and either used as measures of it or deduced as conse- 
quences of it. Examples of other phenomena linked with temperature are 
changes in the electrical resistance of a wire, the voltage produced by 
a thermocouple, and the spectrum of emitted light. As long as any room 
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is left for adding such other effects as may be usefully related to tem- 
perature, the definition is not complete but partial. 

In short, the use of a scientific theory involves three steps: (1) the 
unambiguous connection of a series of observable antecedent conditions 
with the terms of the theory by means of definitions, (2) the rigorous 
derivation of deductions by manipulating the terms according to specific 
rules, and (3) linking the terms in the deductions with observable phe- 
nomena by means of definitions.’ When an observable consequence of 
a certain set of conditions is deduced in advance, it is called a prediction; 
when it is derived after the fact, it is called an explanation [35]. 

The foregoing three steps are involved in all the examples of theory 
from the simple use of empirically fitted curves to the most advanced 
theories of modern physics. It is to emphasize this similarity that these 
diverse levels of theory have been placed together. 

Limitations of scientific theory. Usually laws formulated by specify- 
ing one observable phenomenon directly as a function of other observ- 
able conditions are found to have less generality than formulations in- 
volving a higher level of theoretical abstraction. The great power of 
good theoretical constructs comes from their generality. Though science 
strives towards generality, it is not necessary to have one all-embracing 
theory. Physics made a great deal of progress while it was still divided 
into separate, more limited theories such as those of mechanics, hydro- 
statics, heat, electricity, and optics. 

Although different theories should not be scrambled illogically to- 
gether in one grand eclectic hash, there is nothing to prevent the scien- 
tist from using entirely different models to deal with different aspects 
of his theoretical and practical problems. In short, rigorous but limited 
models can be extremely useful. 

All scientific formulations are highly selective and relatively limited. 
Thus the laws of falling bodies deal only with masses and forces and 
have nothing to say about the socially important facts of whether the 
falling body is a hydrogen bomb that will wipe out a city or a package of 
supplies that will mean survival to a stranded party. Although the latter 
distinctions are enormously important, they are not dealt with by that 
particular set of scientific principles. Similarly, the following hypothesis 
about approach-avoidance conflict behavior will be limited to only a 
few aspects of the conflict behavior. 

Science is limited in another respect. In order to make predictions, 


* It should be noted that the same term may appear in both the antecedent 
conditions and the consequent deductions. Thus, in the example of the Pythagorean 
theorem, the hypotenuse and the base are both measured in the same units of 
length. Furthermore, the length of the hypotenuse may be predicted in one situa- 
tion but used as an antecedent condition in a different situation. 
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one must not only know the principles but also the specific conditions. 
Thus an engineer might have a complete knowledge of all of the relevant 
principles of physics but be unable to predict how much it would cost to 
build a bridge until he knew all the relevant conditions—the load to 
be borne, the length of the span, the velocity of winds, and whether the 
foundation was bedrock or quicksand. Most of the physicist’s exact pre- 
dictions deal with quite specific and precisely defined conditions, such 
as a freely falling body in a vacuum with accurate instruments available 
to measure distance and time. No physicist will even attempt to predict 
where a given snowflake will fall in a blizzard, although snowflakes 
are much more common and of much greater practical importance 
than freely falling bodies in a vacuum. Often the engineer apply- 
ing physical science allows himself factors of safety of more than 200 
per cent to take care of uncontrolled variations in the conditions. He 
may also have to supplement theory with those less precisely verbalized 
but highly significant results of experience which may be described as 
art or skill. 

Nevertheless, these relatively limited activities of the scientist are of 
enormous practical value. They have been the essential basis for giving 
the average man of today medical care, access to the world’s greatest 
music, swift travel, the ability to talk to loved ones at a distance, and 
' myriads of other advantages beyond the dreams of the most powerful 
princes of the past. 

Sophistication of theory and simplification of experiments. In line 
with the need to control conditions as well as to know principles, it is 
my impression that, as our theories of learning and behavior are becom- 
ing more complex and sophisticated, we are being driven to simpler ex- 
perimental situations. As Spence [103] has so well illustrated, the old 
labyrinthian mazes and complex problem-boxes have been found to be 
hopelessly complex and confusing; we are forced into simpler situations 
such as the straight runway and the Skinner box. The classical condi- 
tioning experiments are one such simplified pure case and the principles 
derived from them are believed to be a good first approximation to cer- 
tain useful laws of behavior. The formulation to follow is an example of 
extending these laws to a new situation and testing deductions derived 
from such an extension. 

Plan of the work on conflict. In a modest way, the example to follow 
will illustrate a type of experimental-theoretical work which I believe is 
desirable, Principles which have been abstracted from the results of 
simple, controlled experimental situations, and which have been found 
to have a considerable degree of generality, are applied to a new aspect 
of behavior. As needed, additional assumptions are made. First, very 
simple deductions from these principles are tested in very simple experi- 
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mental situations. Then, attempts are made to apply the joint action of a 
greater number of principles to more complex situations, with additional 
checks at each successive stage of development. 

Role of experiments on animals. It is assumed that the basic mech- 
anisms of learning and conflict are similar enough in man and the other 
mammals so that work on the latter will often generalize to man. More 
specifically, the assumption is that all the psychological processes found 
in other mammals are likely also to be present in man. It should be 
noted that this assumption does not deny the possibility that man may 
have additional capacities which are much less well developed or absent 
in the lower mammals. All that is assumed is that what is found in lower 
mammals will probably be found also in man. 

For certain (but not all) problems it may be desirable to work out 
the laws first in more rigorously controlled experimental situations with 
animals. Although these laws will have to be checked later at the human 
level, it will be easier to check them once they have been precisely formu- 
lated so that one knows exactly what to look for than it would to dis- 
cover them in the complex human situations which are less subject to 
experimental simplification and control. Furthermore, one’s confidence 
in the conclusions from the less-controlled clinical situation is increased 
if the same relationship is found also in the better-controlled animal 
experiments. 

Animal experiments seem to be an especially strategic way to in- 
vestigate certain simple aspects of conflict behavior. On the one hand, 
the variables that appear to be involved (approach, avoidance, pain, 
fear, appetitive drives) seem to be common to both species. On the other 
hand, it is especially hard to subject human subjects in the laboratory 
to carefully controlled, deliberately elicited, strong conflicts of a vital 
nature, but relatively easy to do this with experimental animals. 


FIRST STAGES OF DEVELOPMENT OF SYSTEM 


In the course of an experiment on reaction formation [56], hungry 
albino rats were first trained to run down a short alley for food, then 
given electric shocks at the goal. Such animals showed obvious conflict 
behavior—they would start, go part way toward the goal, stop, and vac- 
illate. The vacillations were characterized by increasingly slower ap- 
proaches interrupted by sudden withdrawals. 

The behavior of these animals seemed worthy of systematic study 
for the following reasons: 

1. The behavior in this experimental situation appeared to be typical 
of conflict behavior in a wide variety of other situations: squirrels in con- 
flict about taking a nut from a child’s hand, children in conflict about 
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petting strange animals, bashful boys in love with a girl but afraid to 
ask for a date, and various phenomena observed in the clinic. 

2. Evidence from both clinical observations and so-called experi- 
mental neuroses indicated the importance of conflict in producing ab- 
normal behavior. 

3. The experimental situation allowed a high degree of control of 
relevant variables in a conflict that was vitally serious to the subject 
animals. 

4. It seemed feasible to devise a little harness device to record in 
detail an important aspect of the behavior, namely, the responses of ap- 
proach and avoidance. 

5. It was possible to deduce a number of facts about the conflict 
behavior in terms of assumptions, such as Hull’s [36] goal gradient, 
which had already been found useful in dealing with other aspects of 
learned behavior. The fact that the observations seemed likely to fit into 
a meaningful pattern was particularly attractive. 

For the foregoing reasons, it seemed desirable to initiate a program 
of theoretical and experimental analysis of conflict behavior. The new 
approach of trying to analyse the laws of conflict behavior instead of 
attempting to produce allegedly neurotic behavior, seemed likely to lead 
to increased scientific understanding, and to help to bridge the gap be- 
tween the laboratory and the clinic. 

The postulates. It seemed possible to account for a number (but not 
all) of the significant aspects of the conflict behavior in terms of the 
following assumptions: 


(A) The tendency to approach a goal is stronger the nearer the subject 
is to it. This is an application of Hull’s [36] principle of the goal gradient 
and will be called the Gradient of Approach. 

(B) The tendency to avoid a feared stimulus is stronger the nearer the 
subject is to it. This was an extension of the general idea of the gradient 
of reinforcement [80] to avoidance learning. It will be called the Gradi- 
ent of Avoidance. 

(C) The strength of avoidance increases more rapidly with nearness 
than does that of approach. In other words, the gradient of avoidance is 
steeper than that of approach. This was a new assumption necessary to 
account for the behavior of going part way and then stopping. 

(D) The strength of tendencies to approach or avoid varies directly 
with the strength of the drive upon which they are based. In other words, 
an increase in drive raises the height of the entire gradient. This assump- 
tion was necessary to explain the fact that stronger shocks stopped the 
animals whereas weaker shocks did not and also to explain the intuitively 
expected result that stronger shocks would be necessary to stop hungrier 
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animals. This assumption was a specific application of the general notion 
that response strength varies with relevant drive. 

(E) Below the asymptote of learning, increasing the number of rein- 
forced trials will increase the strength of the response tendency that is 
reinforced. 

(F) When two incompatible responses are in conflict, the stronger one 
will occur. 
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Fic. 1. Simple graphic representation of an ap- 
proach-avoidance conflict. The'tendency to approach 
is the stronger of the two tendencies far from the 
goal, whereas the tendency to avoid is the stronger 
of the two near to the goal. Therefore, when far 
from the goal, the subject should tend to approach 
part way and then stop. In short, he should tend 
to remain in the region where the two gradients 
intersect. 

It is only for the sake of simplicity that the 
gradients are represented by straight lines in these 
diagrams, Similar deductions could be made on the 
basis of any curves that have a continuous negative 
slope that is steeper for avoidance than for ap- 
proach at each point above the abscissa. From 
Miller [59]. 


Strength of tendency to approach or avoid 


The last two assumptions also are ones which have been found useful 
in accounting for other types of data. Therefore only one of the assump- 
tions, namely, C, is new. 

Some deductions. The first deductions following from these assump- 
tions are illustrated in Figs. 1-3. It should be noted that the gradients 
cross each other in all these figures. All of the following deductions hold 
only within the range of conditions which allow the gradients to intersect. 
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It is also assumed that the subject is free to move, is started away from 
the goal, and has had enough time to reach his point of equilibrium. 

In the first figure, it can be seen that.the approach is stronger than 
the avoidance at a distance from the goal, so that the subject should 
start out by approaching, but when he goes beyond the point at which 
the gradients intersect, avoidance is stronger than approach so he should 
stop. The first deduction is: 

1. The subject should approach part way and then stop. It should 
be noted that this behavior of going part way and then stopping can be 
deduced only from the particular assumptions used. If the gradients are 
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Fic. 2. Graphic representation of how an approach-avoid- 
ance conflict is affected by increasing the strength of ap- 
proach. From Miller [59]. 


parallel, the subject will be expected either to go all the way up to the 
goal or to retreat completely away from it, depending on which tendency 
IS stronger. If the approach is steeper than the avoidance, subjects 
Started near to the goal will be expected to go all the way up to it; sub- 
Jécts started farther from the goal will be expected to retreat until the 
Strength of avoidance becomes negligible. 

Figure 2 shows that raising the height of the approach gradient 
Should move the point of intersection nearer to the goal. The height of 
the approach gradient can be raised by increasing the strength of hunger 
(D) or by increasing the number of reinforced training trials (E). The 
Next two deductions follow: 

2. Increasing the strength of hunger should cause subjects to ap- 
broach nearer to the goal. 
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3. Increasing the number of reinforced training trials (below the 
asymptote) should cause subjects to approach nearer to the goal. 

Similarly, as Fig. 3 shows, raising the height of the gradient of 
avoidance should cause the subject to remain farther away from the goal. 
The height of the gradient of avoidance can be raised by increasing the 
strength of fear (D) or by increasing the number of reinforced avoidance 
trials (E). Hence, the next two deductions follow: 

4. Increasing the strength of fear should cause the subjects to remain 


farther away from the goal. 
5. Increasing the number of reinforced avoidance trials (below the 
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Fic. 3. Graphic representation of how an approach-avoid- 
ance conflict is affected by increasing the strength of 
avoidance. From Miller [59]. 


asymptote) should cause the subjects to remain farther away from the 
goal, 

It should be noted that, for convenience of exposition, the gradients 
of approach and avoidance are drawn as straight lines. Exactly the same 
deductions, however, would follow from any types of curves that have a 
negative slope throughout that is steeper for avoidance than approach at 
each value on the abscissa. 

The advantage of the graphic type of exposition and deduction that 
we have employed is that it is simple and readily understood. The dis- 
advantage is that one must draw some particular type of curve, thus im- 
plying a more specific assumption concerning the exact shape of the 
function than is necessary for immediate purposes or desirable on the 
basis of available evidence. Then, this implication has to be denied by a 
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somewhat awkward disclaimer. It would be possible to make more rigor- 
ous and elegant deductions using either the algebra of inequalities or 
symbolic logic. Such a deduction, however, would consume more space 
and be harder for the average reader to follow. 

Some partial definitions. In order to test the deductions, it was nec- 
essary to relate the terms of the theory to specific experimental conditions 
by partial definitions. Actually, these definitions were intuitively as- 
sumed at first; the following formal statement was made much later. 
Note that whereas the postulates are labeled by capital letters, the par- 
tial definitions are labeled by small ones. 

a. Nearness can be measured by spatial distance in the experimental 
alley. 

b. The animals running to food are being trained to approach under 
the motivation of hunger. 

c. Each trial running to food when hungry is a reinforced training 
trial for approach. 

d. Greater amounts of food deprivation, up to a limit of at least 48 
hr under the particular conditions used, produce greater strengths of 
hunger drive. 

e. Animals running away from electric shock are being trained to 
avoid under the motivation of fear. 

f. Each trial on which the animals receive electric shock is a rein- 
forced trial for avoidance. 

8. Greater strengths of electric shock, within the limits used, produce 
greater strength of fear drive. 

h. Within the limits of the animal’s capacity, the strength of the 
response tendency is positively related to the speed with which the sub- 
Ject will start to run. 

i. Within the limits of the animal’s capacity to pull, the strength of 
the response tendency is positively related to the strength of pull exerted 
by an animal tem porarily restrained. 

Verification of deductions about separate effects. The first experi- 
ments actually performed had to do with developing the recording tech- 
nique and confirming deductions concerning the effects of strength of 
Shock and of hunger on conflict behavior [55]. It was only after these 
Experiments were under way that we thought of studying the avoidance 
and the approach tendencies separately in still simpler experimental 
Situations. It often happens that the simplest ideas are not the ones that 
Occur first. 

For clarity of exposition, we shall first describe the experiments 
Studying approach and avoidance separately in very simple situations. 
In the first of these, Bugelski and Miller [14] trained albino rats to run 
away from a brief electric shock received at the distinctive end of a 
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straight alley. On test trials without shock, they confirmed a simple de- 
duction from the gradient of avoidance (B) and the partial definition 
(h) relating strength of tendency to starting speed: 

6. Subjects placed nearer the end of the alley at which they had re- 
ceived shocks started running faster than those placed farther from the 
shock end. 

Meanwhile, our group was casting about for better ways to study 
separately the effects of the gradients of approach and avoidance. Even- 
tually, we evolved the strength-of-pull technique which involved another 
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Fic. 4. Gradients of approach and avoidance. The ap- 
proach gradient represents the force with which rats 
under a 48-hr hunger drive pulled against a restraining 
harness at different distances from the point at which 
they had been fed. The avoidance gradient shows the 
force with which rats pulled away from the point at 
which they had received a strong shock on the previous 
trial. From Brown [12]. 


partial definition (i). Using this technique, Judson Brown [12] secured 
evidence summarized in Figs. 4 and 5, verifying the following simple 
deductions: 

7. Approach animals pulled harder when stopped near to the goal 
than they did when stopped far from the goal. 

8. Avoidance animals pulled harder when stopped near to the goal 
than when stopped farther from the goal. 

9. Distance from the goal reduced the strength of pull of the avoid- 
ance animals more than that of the approach ones. It should be noted 
that, on the basis of exploratory work, the experimental conditions were 
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arranged so that the strength of pull for the avoidance animals was 
stronger than that of the approach animals in the test near the goal and 
weaker than that of the approach animals in the test far from the goal. 
Since this kind of reversal (i.e., crossing of the gradients) could not 
possibly be produced by any distortion in the units at different levels of 
the scale on the ordinate, the demonstration of greater steepness of avoid- 
ance did not involve any assumptions about the equality of the units on 
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Fic. 5. Effect of strength of drive upon height of gra- 
dient. The two avoidance gradients represent the 
strength of pull on near and far nonshock tests of two 
groups of rats that had received shocks of different 
strengths. The approach gradient represents the strength 
of pull at near and far points of animals tested after 
48 hr of food deprivation; weak approach represents a 
single test at the near point of another group of ani- 
mals with 1 hr of food deprivation. From Brown [12]. 


the ordinate. I believe that it is very important clearly to recognize that 
at present most of our measurements are made with scales that do not 
necessarily involve psychologically equal units. Therefore, we should try 
to design tests that do not depend on the dubious assumption of equal 
Interval scales, 

10, Increasing the number of hours of food deprivation increased the 
strength of pull of the approach animals. 

1. Increasing the strength of electric shock increased the height of 

the gradient of avoidance measured by strength of pull. 

Tt should be noted that even in these simple experiments, the assump- 
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tions were not tested directly. Each test involved the partial definitions 
of experimental conditions, such as nearness and strength of hunger. 
Each test also involved the assumption contained in the partial definition 

(h or i) on which the technique of measurement was based. 
Verification of deductions about joint effects. Having seen that the 
simple deductions, involving the separate effects of approach and avoid- 
ance were verified, let us now turn to the deductions involving the joint 
effect of approach and avoidance in a conflict situation. In order to test 
these deductions, Miller, Brown, and Lewis [75] trained albino rats to 
run down a short alley to food while wearing a harness attached to a de- 
vice which traced on moving paper a record of their runs, Then they 
were given electric shocks at the 


200) goal. Different groups were tested 
i NR: after different strengths of shock 
2 10 and with different degrees of 
| 48 hurdrive hunger. Deductions 1, 2, and 4 

100 were confirmed. The rats tended 

to go part way to the goal and 

| so, then stop. As can be seen in Fig. 
6, either decreasing the strength of 

o hunger or increasing the strength 


2. Shock intensity plotted to logscole Of electric shock caused the rats to 
Fw. 6, Effect of strength of shock and top farther from the goal. 
of hunger on farthest run toward goal At a considerably later time, 
in an approach-avoidance conflict, Kaufman and Miller [42] con- 
firmed deduction 3 by showing 
that in groups given progressively more approach-training trials, pro- 
gressively more of the animals reached the goal in spite of having 
received electric shock. Incidental observations appear to confirm the 
analogous deduction 5 with respect to the effect of increased number 
of avoidance trials reinforced by shock, but that deduction has never 
been systematically tested. 

In short, 10 out of 11 deductions have been confirmed, and inciden- 
tal evidence suggests that the eleventh is probably correct. The ratio of 
10 deductions to 6 assumptions (and 9 partial definitions) would not be 
impressive if each assumption and definition were made post hoc to fit 
the data from these particular experiments. Since only one assumption 
was new, and the partial definitions were determined (although not cx- 
plicitly stated) before the experiments on the basis of other experience, 
the ratio of new facts to new assumptions is high and the verification is 
quite convincing. 

Deduction of greater steepness of avoidance gradient. While testing 
the foregoing deductions, Judson Brown and I had been speculating on 
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the basis for the greater steepness of the gradient of avoidance. It did 
not seem reasonable that this was a function merely of the difference be- 
tween going toward and away, Although we realized that the slope of 
such gradients could be modified by special discrimination training (59), 
such training did not seem to be involved in our experiments, Finally, 
we agreed on an explanation based on the following two additional as- 
sumptions which are commonly made in stimulus-response learning 
theory: 


(G) In the conflict situation (anticipatory to shock) fear is a learned 
drive elicited primarily by situational cues but hunger is more dependent 
on internal physiological factors. 

(H) The strength of learned drives, like that of other learned responses, 
varies inversely with distance from the point of reinforcement, 


The essence of the deduction is as follows. Since fear is a learned 
drive, it will be most strongly aroused by the cues originally most closely 
associated with reinforcement. Therefore, when the subject is confronted 
with cues at a distance, the gradient of reinforcement will have a double 
effect—it will weaken not only the specific responses involved in with- 
drawal, but also the fear motivating these responses. This double effect 
will cause the avoidance to fall off rapidly. On the other hand, since the 
hunger motivating approach is more dependent on internal physiological 
factors, it will be less influenced by distance from the point of rein- 
forcement so that its strength will remain relatively constant. The gra- 
dient of reinforcement will operate only on the responses involved in 
Approach; it will not also affect the drive. Therefore, one can deduce: 

12. The gradient of approach should fall off less steeply than that of 
avoidance. 

As far as the current experiments on conflict were concerned, adding 
the two new assumptions (G and H) in order to replace assumption C 
by deducing it from them, reduced rather than increased the parsimony 
of the system. However, these assumptions had been found to be ex- 
tremely useful in dealing with a considerable amount of experimental 
data in other areas of learning and behavior theory [64] and, as will 
subsequently be seen, suggested new deductions which were experimen- 
tally confirmed. 


SUMMARY OF VARIABLES IN FIRST STAGES OF SYSTEM 


The variables used to account for the results of the experiments in 
the conflict alley may now be listed and discussed according to the cate- 
gories suggested in the outline of Project A. A diagrammatic summary 
of these variables is presented in Fig. 7. 
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Variables I through IV are the intervening variables. It will be 
noticed that, on the antecedent side, each of these is the point of con- 
vergence of more than two arrows, and on the consequent side, the 
source of several arrows either directly (in the case of III and IV) or 
via another variable (in the case of I and II). I believe that there is 
no justification for using an intervening variable unless there is reason 
to expect the variable to be affected by at least two antecedent conditions 
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Fic. 7. Simplest formulation to account for results of experiments completed in 
the conflict alley. The arrows representing the functional relationship assumed 
between the terms are labeled with letters indicating the definitions or assumptions 
involved. There are a considerable number of additional unspecified variables, 
such as amount of food used as a reward, size of bars in shock grid, and individual 
differences among rats, which are either held constant or randomized by the ex- 
perimental design and hence may be omitted for the present purpose. The re- 
sponse of approach (or avoidance), 10 on the consequent side, changes the dis- 
tances 5 and 5’ on the antecedent side. Thus, there is an effect, like that of negative 
feedback, which tends to produce movement toward a point of equilibrium. See 
the text for further discussion. 


and have at least two consequents. A later section will discuss the €x- 
perimental design needed, but seldom employed, to make intervening 
variables useful. 

Relativity of systematic vs. empirical distinction.’ Variables 1 through 
7 are independent; 8, 9, and 10 are dependent. Although these variables + 
are closely related to specific experimental operations, it should be noted — 


‘The discussion in this section takes off from a distinction made by Koch in — 
the original outline of these volumes and further specified in a personal communi- 
cation from him. 
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that they are stated in somewhat general terms. Therefore, according to 
the usage of this book, they should probably be called systematic inde- 
pendent and dependent variables. 

For example, “hours of food deprivation” does not specify the exact 
proportion of various elements in the diet used to maintain the animal. 
It is assumed that the same general qualitative type of relationship— 
namely, that, within limits, greater number of hours of deprivation pro- 
duces stronger hunger and stronger pull—will hold for different kinds 
of diets. This assumption could turn out to be false. Therefore, one 
should call “hours of deprivation” a systematic independent variable. 
The more complete specifications of the maintenance schedule would be 
the empirical independent variable. 

The distinction between systematic and empirical variables, however, 
is relative. Suppose one were to specify the proportion of carbohydrates, 
fats, and proteins in the diet. These might turn out to be satisfactory 
empirical independent variables, or it might turn out that various carbo- 
hydrates were not interchangeable so that one would have to descend to a 
still lower level. One might even specify flour made of wheat from a 
given source, only to find that an as yet unknown vitamin was essential 
so that the particular vintage or conditions of storage of the wheat were 
crucial, 

It is possible to proceed down the scale to an almost infinite number 
of possible empirical independent variables; even the most detailed 
operational description of procedure involves assumptions about the 
general applicability of the terms used. It is impossible for a theorist to 
be completely certain in advance whether or not these assumptions are 
justifiable. It is equally impossible for a person who believes himself to 
be a pure empiricist to avoid such assumptions. For example, some of the 
carly quantitative work on the combustion of gases was confused by the 
fact that the chemists of those days did not know that hydrogen and 
carbon monoxide were two different gases [21]. 

Dynamic relationships. Returning to Fig. 7, it will be noted that 5’ 
(Distance from Point of Reinforcement for Avoidance) has arrows in- 
dicating that it affects both II (Strength of Fear) and IV (Strength 
of Avoidance). This dual effect for avoidance, compared with the single 
effect for approach, is the basis for the greater effect of distance on 
avoidance than on approach. But it can also be seen that the Response of 
Approach or Avoidance on the consequent side of the diagram will 
change the Distance in centimeters from the Point of Reinforcement on 
the antecedent side. Because such changes in distance affect the strength 
of avoidance more than that of approach, the response of approaching 
tends to increase the relative strength of avoidance until approach is 
Stopped, whereas the response of avoidance tends to decrease the relative 
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strength of avoidance until it is stopped. It is this phenomenon ( analogous 
to negative feedback) which leads to the deduction of a point of stable 
equilibrium represented diagrammatically by the point at which the two 
gradients cross. 

We have just pointed out that Distance from the Point of Reinforce- 
ment for Avoidance has arrows indicating that it affects both Strength of 
Fear and Strength of Avoidance. It would be consistent with general 
S-R learning theory to assume similar dual effects (indicated by similar 
double arrows) also for 6 (Strength of Shock) and 7 ( Number of 
Shocks). Such assumptions have not been made specific, however, since 
they are not involved in any of the experiments to date. Indeed, it would 
be difficult to devise an appropriate test comparing the effect of strength 
of shock with that of strength of hunger until we can discover some 
independent means of equating the strength of these two drives in the 
same way that a meter stick can be used to equate the distances of ap- 
proach and avoidance. 

In Fig. 7, each of the arrows representing functional connections be- 
tween terms is labeled with letters designating the partial definitions or 
assumptions involved in the relationship. It will be noticed that some 
terms are linked only by definitions (represented by small letters), some 
are linked only by assumptions (represented by capital letters), and 
others are linked by both. This emphasizes the fact that the distinction 
between a major assumption (postulate or law) and a partial definition is 
often a relative and somewhat arbitrary one. For example, the laws of 
physics involve definitions and the usefulness of the definitions is de- 
pendent on the particular uniformities in nature expressed in certain 
laws. Thus, various laws, such as Ohm’s law, are involved in the con- 
struction of the meters commonly used to measure volts and amperes. 


Sample of General Issues in Behavior Theory Not Included in System 


The arrows from two of the independent variables, Reduced Body 
Weight and Habituation to Feeding Cycle, are only dotted lines and are 
not labeled by any letters. In this particular series of experiments, these 
variables were merely held constant; therefore no special assumptions 
were made about their effects. It would be easy to make such as- 
sumptions, however, and to test the deductions resulting from them. 
These particular variables were put into the diagram to illustrate the 
belief that strength of hunger depends on a number of factors. A con- 
siderable number of other variables, such as amount and palatability of 
food used as a reward, size of bars in the grid, and method of controlling 
electric current, are omitted from the diagram because these variables 
were either held constant or randomized by the design of the present 
series of experiments. It is frequently desirable to restrict one’s problem 
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in this way, but one cannot be certain in advance whether the particular 
restrictions will increase or decrease the fruitfulness of the work. 

The present formulation deals with strength of hunger during per- 
formance, or in other words, at the time the test is made. One could also 
ask: “What is the effect of strength of hunger during training?” In 
other words, one could transpose to the conflict situation the problem 
of the effect of drive on learning vs. on performance. Going to the logical 
extreme of running animals to food when not hungry and testing them 
when hungry raises the problem of latent learning. Similarly, an in- 
vestigation of the effects of changing either the strength or the type of 
drive between training and testing would get one into the problem of 
drive generalization. One could also try to determine whether the effects 
of drive and of number of reinforced approach trials combine multi- 
plicatively or in some other fashion. Massed vs. distributed trials could 
be studied or forgetting curves plotted, or one might start off from an 
early experiment of mine [53] and analyze the conditions under which 
a rat placed directly into the goal box, which was concealed by a curtain, 
and shocked there would, without any further training, show foresight 
by not running down an alley leading to that goal box. Work of this kind 
would enter the realm of the higher mental processes. In short, most of 
the manifold problems of learning and behavior theory are potentially 
involved in this miniature system. 

As a point of departure, it would be simplest to assume that all the 
variables involved in these problems have effects on conflict behavior 
Similar to those observed in other experimental situations. But that as- 
sumption is by no means certain. Ultimately, the effect of all these 
variables could and should be investigated in a conflict situation. If these 
variables function in the same way in the conflict and in other situations, 
this finding will increase our faith that we are “carving nature to the 
joint.” Tf they function in different ways, we shall be faced with new 
and interesting problems. 

With such a large number of possibilities available, and only limited 
resources, the investigation of all these variables was deferred and no 
Specific assumptions were made concerning their effects. The decision, 
then, was to hold such variables constant in the studies while expanding 
the theoretical and experimental program in a different direction. 


INCLUSION OF STIMULUS SIMILARITY AND EXTENSION 
TO DISPLACEMENT 


Now that the early stages of the development of the miniature system 
have been described, some of its extensions may be briefly sketched. 
From almost the beginning, we thought of distance in space as a 
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special case of the stimulus similarity. According to this hypothesis, the 
spatial gradients of approach and avoidance could be derived from the 
more general principle of a gradient of stimulus generalization. Further- 
more, the notion of a gradient of stimulus generalization allowed us to 
incorporate into the system conflicts induced by difficult discriminations. 
The theoretical analysis and experimental verification have been described 
elsewhere [10, 59]. 

Tests of simple deductions. We shall only mention here an ex- 
periment in which Brown [11] trained animals to approach an illu- 
minated ground-glass panel to secure food. Then he measured the rat’s 
strength of pull when temporarily restrained as a function of the similarity 
of the test illumination to that during training. The effects of absolute 
brightness were balanced out by groups originally trained to approach 
either dim or bright panels, respectively. 

He found that the animals tested with stimuli more similar to those 
used in training pulled harder and thus verified a simple deduction from 
the assumption that there is a gradient of stimulus generalization of the 
approach responses. Furthermore, animals tested with food deprivation 
pulled harder on both the similiar and dissimilar tests, thus verifying a 
simple deduction from the assumption that increases in drive raise the 
height of the entire gradient. (See Fig. 16.) Incidentally, I believe that 
this latter assumption may account for a considerable range of phe- 
nomena outside the realm of conflict behavior—for example, apparent 
reductions in S-R thresholds under increased drive, effect of drive on 
responses in certain projective test situations, and certain aspects of 
hallucinatory behavior. 

More complex deductions. If one assumes that gradients in space 
and in other dimensions of stimulus similarity are basically the same, one 
will expect the gradient of stimulus generalization of avoidance motivated 
by a learned drive, such as fear, to be steeper than that of approach 
motivated by a primary drive such as hunger. This assumption, along 
with the preceding ones, allows us to extend the miniature system to 
include phenomena of the type that have been described clinically as 
displacement. In an analysis presented elsewhere [61], I have used this 
extension to derive the following eight deductions describing specific 
ways in which the phenomena of displacement should be affected by 
various conditions: 

1. When the direct response to the original stimulus is prevented by 
the absence of that stimulus, displaced responses will occur to other 
similar stimuli, and the strongest displaced response will occur to the 
most similar stimulus present. For example, a girl who is prevented from 
marrying her sweetheart by his death, and who is completely recovered 
from her grief and other possibly negative factors, will be expected to 
prefer the suitor who is most similar to him. 
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9, When the direct response to the original stimulus is prevented by 
conflict, the strongest displaced response will occur to stimuli which have 
an intermediate degree of similarity to the original one. Thus a girl who 
is prevented from marrying her sweetheart by a violent quarrel would 
be expected to prefer someone not completely similar, but not completely 
different. 

3. If the relative strength of the inhibitory response is increased, 
the point of strongest displacement, and hence object choice, will shift 
in the direction of stimuli which are less similar to the original one 
eliciting the direct response. In other words, the more unhappy the 
girl’s experience with her previous sweetheart, the less similar should be 
her choice of a second love object. 

4. If the strength of the drive motivating the direct response to the 
original stimulus is held constant, the strength of a displaced response 
will be weaker than the direct response to the original stimulus would 
have been. 

5. If the strength of the drive motivating the direct response to the 
original stimulus is held constant, the strength of the displaced response 
will be greater when the direct response to the original stimulus is pre- 
vented by the absence of that stimulus (provided other very similar stim- 
ulus objects are present), and progressively weaker the stronger the in- 
hibition involved. 

Corollary: If the inhibition is strong enough so that the two gradients 
do not cross, no displaced response will occur. 

6. If the drive motivating the direct response to the original stimulus 
is increased, the strength of all displaced responses will be increased. 

7. If the strength of the drive motivating the direct response to the 
original stimulus is increased, it will be possible for increasingly dissimilar 
stimuli to elicit displaced responses. This might give the appearance of 
lowering the threshold of response to such stimuli. 

8. In situations in which the direct response to the original stimulus 
is prevented by conflict, increasing the strength of the drive to the in- 
hibited response will shift the point of strongest displacement, and hence 
object choice, in the direction of stimuli which are more similar to the 
original one eliciting the direct response. 

General clinical evidence seems to confirm these deductions. Further- 
more, other investigators have extended this system to predictions of be- 
havior in projective doll-play situations and have secured verifications 
of a number of deductions [109, 97]. 

Additional experimental tests. After seeing that the more general 
assumptions would help to explain clinical phenomena of displacement, 
an attempt was made to test deductions in experimental situations. 
Murray and Miller [89] used the strength-of-pull technique to study 
separately the generalization of approach and avoidance to new stimulus 
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situations, e.g., from a narrow black to a wide white alley. They con- 
firmed the simple deductions, by showing (1) that there is a gradient of 
generalization for both habits, and (2) that the avoidance habit is 
weakened more by generalization than the approach one. 

Miller and Kraeling [79] verified the deduction from the joint 
action of the two types of generalization by showing that when an 
approach-avoidance habit is established in one situation and generalized 
to another somewhat similar one, albino rats are more likely to approach 
the goal in the new situation than in the original one. 

Murray and Berkun [88] applied the model to a situation involving 
both the dimensions of distance and similarity and confirmed the follow- 
ing deductions: After an approach-avoidance conflict had been estab- 
lished in an original alley, rats left it and entered other somewhat dif- 
ferent ones. The more different the new alley, the nearer they approached 
to the food end. The second of these two deductions has been confirmed 
in a somewhat different experimental situation by Berkun [6] who also 
confirmed a number of additional deductions concerning the “thera- 
peutic” effects of various types of trials in the original and in the displace- 
ment situation. 

Implications for projective tests. The extension of conflict theory to 
displacement has certain implications for projective tests. When a re- 
sponse to an original stimulus is inhibited by conflict, we might expect 
it to be more likely to occur to a somewhat similar but somewhat dif- 
ferent stimulus than to a more direct question about, or picture of, the 
original situation. Furthermore, the stronger the subject’s motivation, and 
the weaker his inhibition, the more likely the inhibited response should be 
to occur. 

But other factors must be considered also. One of these is the nearness 
(or similarity along a variety of culturally modified dimensions) of the 
response elicited by the test situation to the primary tabooed one. Com- 
bining this dimension with the one of stimulus similarity, one might ex- 
pect pictures depicting a violent form of aggression against an insect to 
be roughly equivalent to those depicting a mild form of aggression 
against a man. 

Other dimensions are the ambiguity of the eliciting stimulus which 
will determine the range of responses that are not in too direct conflict 
with it, and the generally permissive or prohibitive context of the test 
situation. To these we must add the subject’s responses to certain 
thoughts which may be elicited but not reported. For example, the lower 
avoidance in the projective situation might release a sexual or aggressive 
thought that would be frightening enough to cause the subject to “re- 
treat” and fail to express even relatively innocent thoughts. 

From the foregoing incomplete analysis we can see why the method 
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of the projective test should have certain advantages, but we can also see 
that the factors determining the net outcome of the conflicting tendencies 
are by no means simple. 

Why avoidance gradient is steeper. The extension of the assumptions 
about spatial distance to include all dimensions of stimulus generaliza- 
tion made it easier to test a crucial prediction from the assumptions used 
in deducing the greater steepness of the gradient of avoidance. Applied 
to the situation of stimulus generalization, the essence of the original 
deduction runs as follows. When the subject is tested for avoidance in a 
stimulus situation different from the one used in training, the gradient 
of stimulus generalization has a double effect—it will weaken not only 
the specific responses involved in avoidance, but also the fear motivating 
these responses. When a similar test is given to other subjects who have 
been trained to approach, the 
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we group, oh avoidance animal motivated by the learned drive of fear 
is taught to run down an alley in and by the primary drive of pain. From 
order to escape from a strong Miller and Murray [82]. 

electric shock and then is tested 

without shock in both the original and a different alley, that group 
should show a steep gradient of stimulus generalization because both 
the fear motivating their behavior and the habit of running will be 
weakened by the change to the new alley. If another group of animals 
is trained in exactly the same way but tested in both alleys with 
the shock on, the change in the stimulus situation will not be ex- 
pected to affect the drive (pain and fear from shock) but only the 
habit of running. Therefore, this group should show a less steep gradient 
of stimulus generalization. 

The results of an experiment by Miller and Murray [82] testing 
this deduction are shown in Fig. 8. It can be seen that, exactly as pre- 
dicted, the group trained and tested with a 310-volt shock showed a 
relatively flat gradient of stimulus generalization (actually sloping up, 
but not reliably so), while the fear group, trained with a 310-volt shock 
and tested without shock, showed a gradient that sloped downward. The 
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difference in slope is highly reliable in the direction confirming the 
deduction. ) 

It can be seen also that administering the tests with the shock on 
not only greatly flattened the gradient of stimulus generalization but also 
raised the height of the entire gradient. Therefore, one could say that 
the gradient was flatter merely because the animals in the different alley 
were pulling as hard as they could so that further increases in strength 
of avoidance could not increase the strength of pull. This would be the 
same thing as saying that the units of measurement were greatly com- 
pressed toward the top of the ordinate so that the upper curves were 
spuriously flattened. One check on this was further to increase the 
strength of shock. As can be seen, this increased the height of the 
gradient, indicating that we were not running entirely off the scale. 

Another much more convincing check was running another group 
trained and tested with a weaker shock (100 volts). As can be seen, the 
gradient of this group actually crossed that of the one trained with the 
stronger shock but tested without shock. Since the relationship of crossing 
can not be an artifact of the units of measurement, our confirmation did 
not depend on any assumptions concerning the equality of the units on 
the ordinate. 

Can the approach gradient ever be steeper than that for avoidance? 
We have just seen evidence supporting the deduction that the greater 
steepness of the gradient of avoidance in our experimental situations was 
produced by the fact that distance (or stimulus dissimilarity) had a 
double effect on that gradient—weakening not only the strength of the 
avoidance habit, but also that of the learned fear motivating that habit. 
This type of analysis raises an interesting question: what would happen 
if the drive motivating avoidance were elicited chiefly by internal cues, 
which remained relatively constant in different external stimulus con- 
texts, whereas the drive motivating approach was primarily aroused by 
external cues which changed with the stimulus context? 

In such a case, the gradient of approach should be steeper than that 
of avoidance. Then, as has already been explained, we would expect the 
subject either (a) not to approach at all, if the intersection of the 
gradients was between him and the goal, or (b) to go completely up to 
the goal, if he was between it and the point of intersection. The subject’s 
behavior might vary markedly from situation to situation, but we would 
not see that pattern of approaching part way and then stopping and 
vacillating which is so easy to identify as a symptom of conflict. It is con- 
ceivable that such behavior occurs under certain circumstances, but has 
been generally unreported because it is less perspicuous than the other 
pattern. 
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Summary of Variables in the System Expanded to Include Dis- 
placement and Additional Independent and Dependent Variables ` 


A somewhat expanded formulation to account for experiments per- 
formed on conflict and displacement and for future experiments involv- 
ing certain additional variables is summarized in Fig. 9. The intervening 
variables are I through IX; the systematic independent variables are 1 
through 21; the systematic dependent variables are 22 through 28. Be- 
cause the distinctions among such variables are not absolute, we see 
again that some of the variables that are described as systematic in- 
dependent might, on some interpretations of the theory, be classified as 
empirical independent. Distance in centimeters from point of reinforce- 
ment of approach is a case-in point. Similarly, a few variables that have 
been classified as intervening might be considered to be systematic in- 
dependent, examples being amount of reinforcement for approach, 
stimulus similarity, and strength of noxious stimulus. In any event, the 
exact classification of the variables is not nearly as important as the way 
they are assumed to function. 

One of the most important changes in the diagram is that the inter- 
vening variables are made the focus of more arrows. For example, hunger 
is assumed to be affected by six different independent variables and to 
affect the rate of bar pressing reinforced on a variable-interval schedule 
as well as the strength of approach measured by speed of response and 
strength of pull. These assumptions suggest a large amount of additional 
experimental work. To give a single example, any combination of factors 
1 through 6 which produces an increase in rate of bar pressing will be 
expected also to produce an increase in speed of response, strength of 
pull, and distance of approach toward the goal. If the results of a variety 
of experiments testing such expectations show the type of consistent inter- 
relations demanded by the assumptions, the status of hunger as a useful 
unitary intervening variable will be greatly increased. But there is a 
very real possibility, indeed a probability, that such experiments will 
show that this simple formulation has to be modified. This general topic 
will be pursued further in a later section on the type of experimental 
design required, but seldom used, to make intervening variables useful. 

Many additional experimental problems are implicit in this ex- 
panded formulation. We shall discuss only a few examples. The dotted 
arrow with the circled minus sign between hours of water deprivation 
and strength of hunger is meant to raise the problem of the possible inter- 
actions between hunger and thirst. ‘After the various interactions have 
been studied, will the results fall into a simple pattern so that a given 
strength of thirst, no matter how it is produced, has a certain effect on 
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the strength of hunger? Or will it be simpler to assume that specific 
operations such as water deprivation, or eating dry food, have effects on 
both hunger and thirst? 

Turning our attention for the moment to IX on the right-hand side 
of the diagram, the representation there is meant to imply that speed of 
response and strength of pull are a function of the algebraic difference 
between the two response strengths. This assumption, however, is by no 
means certain. It is entirely possible that administering at the goal shocks 
that are too weak to stop the animal from approaching and eating will 
be found to have the dynamogenic effect of increasing speed of running 
or strength of pull instead of reducing them as would be expected from 
algebraic summation. 

It can be seen that in this expanded formulation of the system, there 
is a relative decrease in the proportion of instances in which the theorist 
is assuming that a relationship which has already been found useful in 
integrating a large number of observations in other situations will also 
be applicable to a new situation. There is a corresponding increase in 
the proportion of instances in which the theorist is simply making a more 
or less plausible guess in order to pose a problem clearly. This change 
is largely the result of attempting to tie down the intervening variables 
by a greater number of experimental operations on the antecedent side 
and measures on the consequent side. The attempt to extend the system 
to the types of experimental situations that will really test the general 
utility of the intervening variables spotlights difficult problems which are 
latent in many of our efforts to construct psychological theories. 


Effects of Alcohol and Barbiturates 


Conger [22] has extended the theory in a different direction by 
adding the assumption that alcohol produces a greater reduction in fear 
motivating avoidance than in hunger (and presumably other primary 
drives) motivating approach. He has confirmed the simplest deductions 
from this extension by measuring approach and avoidance separately in a 
controlled experimental situation and showing that the alcohol produces 
a greater reduction in the strength of pull of frightened rats avoiding the 
place where they had previously received electric shocks than that of 
hungry rats approaching food. 

He has tested also the more complex situation in which approach 
and avoidance are operating simultaneously. He trained hungry rats to 
approach food and then threw them into an approach-avoidance con- 
flict by giving them electric shocks at the goal. He verified the deduction 
by finding that they were more likely to go back to the food after re- 
ceiving injections of alcohol than of normal saline. : 

He also verified another deduction, namely, that it was easier to 
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train rats to avoid food (and electric shock) on the trials on which they 
were sober, and to go to food (without shock) whenever they were drunk, 
than it was to teach them the opposite discrimination of going to food 
when they were sober and avoiding it when drunk. Thus, Conger’s 
assumption was verified by deductions involving three relatively different 
techniques: strength of pull, conflict behavior, and the learning of a 
discrimination. 

This extension of the theory has been used to explain a number of 
the perplexing social effects of alcohol by Conger [22], Dollard and 
Miller [25], and Miller [68]. 

Bailey and Miller [3] have extended the theory along similar lines 
by assuming that barbiturates, such as sodium amytal, produce a greater 
reduction in the strength of the fear motivating avoidance than in that 
of other drives motivating approach. A deduction from this assumption 
has been used to explain an observation reported by Masserman [50] and 
has been confirmed in a similar but simpler experimental situation in 
which Bailey and Miller [3] found that an injection of sodium amytal 
caused cats in a simple approach-avoidance conflict to resume eating at 
the place where they had received electric shocks. 


EXTENSIONS TO PSYCHOTHERAPY 


Miller [59] and Dollard and Miller [25] have further extended the 
miniature system to cover some of the phenomena commonly observed 
in psychotherapy. This extension involved the following additions to the 
partial definitions: 

j. The definition of nearness is extended to apply to any situation 
in which the subject can be said to be coming nearer to a goal in space, 
time, or some dimension of qualitative or culturally defined similarity of 
cues. 

k. The definition of avoidance is extended to apply to the responses 
producing inhibition and repression. 

l. The definition of approach is extended to the responses that are 
inhibited or repressed. á 

The meaning of the foregoing three definitions is illustrated by the 
example of a patient who is trying to remember the traumatic battle- 
field event that has produced a functional amnesia. His goal is to 
remember the traumatic event. According to l, the chain of thoughts 
leading up to the inhibited memory of that event will function in the 
same way as approach response in the conflict alley. According to k, the 
responses producing inhibition and repression of the memory will 
function like the avoidance responses in the conflict alley. According to j, 
remembering events more closely associated with the traumatic memory 
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will be analogous to coming nearer to the goal in a conflict alley. There- 
fore, as the subject approaches nearer to his goal of remembering the 
traumatic event, we will expect the strength of the repression to increase 
more rapidly than the strength of the memory, so that the subject will be 
able to remember more remotely associated events, but not the nearer 
ones. Similarly, we will expect barbiturates, such as sodium amytal, to 
reduce the gradient of strength of the repression so that the subjects will 
be able to remember events nearer to the traumatic ones, and if the 
repression is not too strong, will be able to remember the traumatic 
event itself. 

It should be noted that the foregoing definitions are only a con- 
venient, first approximation; it may be necessary to modify them. For 
example, we have already suggested that the gradients of all responses 
now classified as approach may not fall off more steeply than all those 
now classified as avoidance. The analysis that we have made of the 
reasons why the gradient of avoidance is steeper than that of approach 
suggests that it might be better to classify responses on the basis of the 
way the motivation involved is aroused. Responses primarily motivated 
by drives that are dependent on external cues in the environment should 
show steeper gradients of generalization than those motivated by drives 
dependent on physiological factors or internal cues that the organism 
carries with it from situation to situation. 

Need for better definition or scaling of empirical variables. It is 
obvious that the definitions stated above are a considerable distance from 
the empirical level. Therefore, it frequently will be more difficult to 
secure complete agreement on the application of these definitions to 
clinical phenomena than it has been to secure agreement on the ap- 
plication of the preceding ones to simple experimental situations. The 
chief roadblock to rigorously testing the applicability of the conflict 
system to clinical phenomena is the lack of more exact definitions of the 
empirical variables. I believe that this barrier can be overcome by 
creating various scales. At first, these will probably have to be very 
simple ordinal scales, such as the following tentative scale of distance 
from a young man’s goal of marriage: being present at a social gathering 
with girls, coming near to a girl, talking to a girl about impersonal sub- 
jects, talking about more personal subjects, asking for a date, dancing, 
kissing, going steady, becoming engaged, getting married. These scales, 
like any others, will have to be validated by their usefulness in establishing 
relatively simple but general scientific laws. 

The task will not be easy and the fact that we probably will be deal- 
ing with acquired equivalence of cues [76] leads to the distressing pos- 
sibility that the events may have to be ranked in a somewhat different 
order for different individuals. We may have to start with cases in which 
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the cultural training is quite uniform for all individuals in our society, or 
group together individuals who have had similar experiences. 

In a recent monograph, one of my students, Murray [87], has 
described some of his attempts to make and use simple scales based on a 
quantitative content analysis of recorded psychotherapeutic interviews. 
Murray and Berkun [88] have used such a content analysis in an at- 
tempt to test deductions from an application of the conflict theory to a 
psychotherapeutic case and have checked the results against those of an 
experiment on rats. Auld and Murray [1] have summarized some of the 
attempts at scaling by other people with different theoretical orientations. 
Meanwhile, Dollard, Auld, and White have been working for some 
time on a monumental effort to scale psychotherapeutic materials and to 
test the reliability and the descriptive adequacy of their categories. 

Of course, the usefulness of such scales is not limited to the present 
miniature system. The development of any program of linking any type 
of theory more rigorously to the empirical facts of psychotherapy depends 
upon the innovation of better techniques for securing public agreement 
upon these facts, or in short, upon evolving better definitions of the 
empirical variables. 

Tentative verification of selected deductions concerning psycho- 
therapy. In the absence of more rigorous definitions of the empirical 
variables, any attempt to check deductions from our miniature system 
against the phenomena of psychotherapy will necessarily be somewhat 
informal and limited to situations which seem to be clear-cut enough 
so that it is hoped that one can achieve general agreement. Such selec- 
tion of situations, however, makes it easier to ignore phenomena that do 
not clearly fit the theory. With these limitations in mind, one can say 
that, as a first approximation, a number of significant deductions seem 
to be tentatively confirmed by the clinical observations that are available. 
Going part way and then stopping, or in other words, being unable to 
achieve or leave the goal, seems to be characteristic of patients in an 
approach-avoidance conflict. Furthermore, weakening the strength of the 
drive motivating avoidance or increasing the strength of the drive 
motivating approach scems to cause patients to go nearer to their 
goals. 

At this point, we come to some new deductions which may be made 
with the assumption H, that fear, like other learned responses, varies in- 
versely with distance from the point of reinforcement. In other words, 
we will expect the strength of fear elicited at any given distance from a 
feared stimulus to be a function of the height of the avoidance gradient 
at that particular distance. Turning back to Figs. 2 and 3, the following 
relationships can be seen to hold for the range of distances within which 
the two gradients intersect: 
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1. Increasing the strength of approach should cause the subject to go 
nearer to the feared goal, and at this point, stronger fear should be 
elicited. 

2. Decreasing the over-all strength of avoidance should cause the 
subject to go nearer to the goal, and at this point stronger fear should 
be elicited. 

3. The increase in fear should be greater when the same distance of 
approach toward the goal is produced by raising the gradient of ap- 
proach than when it is produced by lowering the gradient of avoidance. 
It should be noted that this deduction is dependent on the fact that the 
gradient of avoidance is steeper than that of approach.° 

4. In each of the above cases, greater distances of approach toward 
the goal should produce greater increases in fear. In order not to be 
dependent on the assumption of linearity, this deduction must be re- 
stricted to those cases in which the greater distance of approach includes 
the smaller one. 

5. After the goal is reached, further increases in the strength of 
approach should not produce additional increases in the fear elicited, but 
further reductions in the strength of avoidance should produce additional 
reductions in the fear elicited. 

The evidence supporting these deductions has been summarized in 
somewhat more detail elsewhere [25]; it can only be suggested here. The 
first of these deductions is in line with the clinical evidence that in- 
creasing the patient’s motivation to approach seems to increase his fear 
and conflict. As would be expected from the last two deductions, such 
increases are practicable without producing intolerable fear only when 
the patients are relatively near to the goal, either because the initial 
conflict is weak or because they are approaching the end of successful 
therapy. Attempting to force patients ahead does not seem to be prac- 
ticable in the face of the strong inhibitions and repressions of the severe 
neurotic. 

The second deduction offers an explanation for the paradoxical 

*If we take the greater steepness of the gradient of avoidance as a postulate, 
and assume that the strength of fear elicited is a function of the height of the 
gradient, this deduction follows immediately. If we attempt to deduce the greater 
steepness of the gradient of avoidance, the deduction concerning the strength of 
fear is not so simple and unequivocal. It would be simplest to assume that the 
fear generalized with distance in exactly the same way as does any other habit. 
This would give us a gradient of fear parallel to the gradient of approach, and 
the greater steepness of avoidance would emerge only when the strength of the 
fear drive along its gradient potentiated (e.g, was multiplied by) the strength of 
the avoidance habit along its gradient. On the other hand, one could assume that 


the hunger drive would provide relevant cues for approach which would make 
the near and far situations more similar for that habit and hence make the gradient 


of approach flatter than that of fear. 
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negative therapeutic effect. After the therapist has succeeded in diminish- 
ing the patient’s exaggerated idea of the dangerousness of the goal so that 
the patient moves nearer to it, one frequently observes an increase in the 
amount of fear and conflict elicited. Even more striking results appear 
where the conditions are better known and more clear-cut, namely, in the 
use of the barbiturates to treat amnesia produced by traumatic conditions 
of combat. In such cases, we have seen that the hypothesis that this drug 
reduces the fear motivating avoidance leads us to expect a decrease in 
the repression producing amnesia for the traumatic events. But, as the 
amnesia is lifted and the subject approaches nearer the goal of recover- 
ing his memory of the traumatic incident, one should observe an obvious 
increase in the amount of fear elicited. According to the vivid descriptions 
of Grinker and Spiegel [28, 29] this is exactly what happens. 

The third deduction seems to be confirmed by the general experience 
of therapists that it is much better to concentrate on reducing avoidance 
(in other words, analyzing resistance) than on trying to increase ap- 
proach. As would be expected from the last two deductions, this pro- 
cedure seems to be required especially during the early treatment of 
severe neurotics. 

It should be possible to make rigorous tests of the foregoing five de- 
ductions in simple experimental situations with animals provided one 
could find some measure (other than distance of approach) that will 
vary with fear and can be applied to the freely moving animal in the 
conflict alley. Our group has spent considerable effort trying to develop 
such measures (23 and 24 in the dotted box in Fig. 9) on rats. We have 
not succeeded to date and are planning to shift to dogs which we hope 
may be better for this purpose. 


GENERAL EVALUATION 


Range of application. The range of application of this miniature 
system is potentially quite broad; its boundaries are not sharply defined. 
The same assumptions that are used in it can explain a number of the 
characteristics of avoidance-avoidance conflict and the difference be- 
tween such conflict and pure approach-approach competition; they are 
also useful in dealing with double approach-avoidance conflict (i.e., 
choice between two goals eliciting ambivalent responses), and with dis- 
crimination conflict. These additional applications have been described 
elsewhere [59]. 

The concepts in the system presumably apply wherever the relevant 
conditions appear, be it in a simple animal experiment, a projective test, 
child rearing, a neurosis, psychotherapy, combat, economics, politics, or 
international relations. So far, attempts at applications have been made 
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only in the areas of experimental work on conflict and displacement with 
animals, effects of alcohol and barbiturates on conflicts, and conflicts 
occurring in child rearing, neurosis, and psychotherapy. In all these areas, 
except the animal experiments, rigorous testing of the application of the 
theory is severely limited by the difficulty in specifying the relevant condi- 
tions and in measuring the relevant responses, or in other words, defining 
precisely the empirical data variables. Therefore, the applications to these 
other areas are at present somewhat programmatic. 

Evidence for the system. The main studies which this miniature sys- 
tem has mediated have already been described. Almost all of them have 
been experiments on rats running in straight alleys. These experiments 
were specifically designed to test deductions from the assumptions made 
in the system. They have involved developing new techniques, such as 
recording the distance of approach and the strength of pull, and have 
resulted in the gathering of new data. 

Because of the systematic, step-by-step program of research, the evi- 
dence from the foregoing animal studies is relatively complete and con- 
vincing for the part of the system that is summarized in Fig. 7, and for 
the part of Fig. 9 that deals with the extension to displacement. 

The extension of the system summarized in the part of Fig. 9 that 
is relevant to experiments designed to test the unitary character of the 
intervening variables remains programmatic. However, the problem of 
determining whether hunger, for example, functions as a single unitary 
intervening variable, or as a cluster of loosely associated variables, is a 
somewhat separate issue. Specific hungers and/or satiation variables 
could be substituted in the system without changing its essential char- 
acter. Furthermore, it is quite possible that the unity of the intervening 
variables can be tested more economically in situations that do not nec- 
essarily involve approach-avoidance conflict [e.g., 68, 72]. Again we see 
how a theory of conflict behavior can potentially involve all the contro- 
versial issues of behavior theory. Carving out a particular miniature 
system is largely an arbitrary matter of expediency. 

Data that do not fit. We have confined ourselves to only one aspect 
of conflict behavior—the maximum distance of approach toward a goal. 
Some other significant aspects of behavior, which are not so far included 
in this system, will be mentioned in the next section on plans for the 
future, Furthermore, we have picked experimental situations which mini- 
mize the difficulty of defining the response and the stimulus. For example, 
we chose stimulus situations (narrow black, intermediate gray, and wide 
white alleys) which were relatively easy to arrange in an ordinal series 
of similarity. 

The most relevant single observation not accounted for by the present 
system is the fact that, instead of stopping and remaining at a given 
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point as might be expected from the intersection of the gradients, the 
animals tend to oscillate in a general region. As has been pointed out 
elsewhere [59] the existence of such oscillations can be accounted for by 
an assumption which has already been found useful in explaining other 
types of behavioral data, namely, that the initiation of a given response 
of either approach or avoidance produces stimuli which provide addi- 
tional cues for continuing that response. In other words, a certain 
amount of positive feedback is involved which functions like psychologi- 
cal momentum and causes the subject to overshoot. This hypothesis will 
account for the general shape of the most characteristic and regular 
oscillations, but will not by itself account for the fact that the retreat is 
so much more rapid than the approach. Other oscillations appear to be 
more random. 

A second regular pattern of behavior observed in the conflict alley is 
a stepwise one in which an increasingly hesitant approach is terminated 
by a period of crouching which is followed by another approach, etc. 
It seems probable that this pattern can be accounted for by extinction of 
fear during the period of crouching. Furthermore, the assumption con- 
cerning response-produced cues allows one to deduce the conditions that 
should favor oscillations and those that should favor stepwise approach. 

It can be seen that the response-produced cues, if indeed they pro- 
duce an effect analogous to positive feedback, introduce an element of 
unstable equilibrium, or all-or-nothing tendency, into the behavior. This 
effect is exactly opposite to that of the crossing of the gradients, which 
functions like negative feedback and produces stable equilibrium. And 
indeed, the behavior of animals in approach-avoidance conflict does ex- 
hibit these two opposing attributes. The animal placed at the start of the 
alley may go part way up and then stop and vacillate in a reasonably 
narrow region as though oscillating around a point of equilibrium. Then 
he may retreat completely back to the start of the alley and remain 
there for a while, as though some other source of positive feedback, such 
as the anticipatory goal response, had been removed. Then he may sud- 
denly start out on another approach episode. 

This all-or-nothing aspect of the behavior is completely obscured in 
the process of securing the averages plotted as the curve in Fig. 6. In 
fact, it would be possible to secure such a curve if all the animals either 
remained crouching at the start or went completely to the goal, with an 
increasing proportion reaching the goal as the strength of hunger is in- 
creased or that of electric shock decreased. Actually, Fig. 6 represents 
two effects: an increasing proportion of animals shifting all the way from 
the start to the goal, as well as shifts toward the goal by animals that go 
part way and then stop. 

But the shorter the alley, the greater the probability that the gradients 
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will not intersect within its length, and also that chance fluctuations in the 
level of motivation and/or the effects of the mechanisms producing posi- 
tive feedback will shift the point of intersection from beyond one end of 
the alley to beyond the other end. In such cases, the behavior will appear 
to be solely of the all-or-nothing type; it will be impossible to demon- 
strate going part way and then stopping. 

In order to minimize possible artifacts produced when the point of 
intersection oscillates beyond the end of the alley and to secure more 
clear-cut effects which can be observed in single animals without artifacts 
of averaging, Coons, Trapold, and I have studied conflict behavior in a 
runway over 100 ft long. We observed that a given animal might oscil- 
late over a considerable portion of this distance. However, we found that, 
when hungrier, the animals tended to advance nearer to the goal and 
retreat less far from it, than when they were less hungry. Since these 
comparisons did not include trials on which an animal either failed to 
start or ran completely to the goal, the general results summarized in Fig. 
6 are confirmed without any possibility of the trend’s being an artifact of 
averaging a number of all-or-nothing changes. 

Comparison with competing formulations. As far as behavior in 
space is concerned, Hull [38, 40] has made a formulation that is so 
similar to the present one that it is supplementary rather than competing. 
Hull has focused his attention more on developing certain implications of 
shrewd guesses concerning the exact quantitative shape of functions. On 
the other hand, I have tended to feel that, for the moment, it is better to 
concentrate on qualitative relationships that can be tested with the 
ordinal scales at our disposal and to test the general applicability and 
functional unity of the intervening variables before devoting time to 
further quantification that assumes cardinal scales. Instead of choosing 
a single quantitative curve and exploring its consequences, I have ap- 
proached quantification by approximation and correction, determining 
what kinds of results, obtainable with the qualitative scales that we now 
possess, would narrow the range of possibilities to curves with certain 
general characteristics [61, p. 171]. 

With respect to behavior in space, Lewin’s [48] insightful formulation 
is similar in its main functional attributes, though couched in quite dif- 
ferent language. His system has been less thoroughly worked out and is 
especially deficient in the attempt to formulate the partial definitions re- 
quired to bridge the gap between the theoretical formulations in terms of 
psychological life space and the actually observed empirical independent 
and dependent variables. Perhaps that is one of the reasons why it has 
been responsible for fewer experimental studies. Because of the functional 
parallelism of the basic assumptions involved in the two, the choice be- 
tween Lewin’s formulation of conflict and mine does not hinge specifi- 
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cally on agreement with empirical data on conflict behavior but rather on 
the general advantages of a life-space vectorial system vs. a liberalized 
stimulus-response one. r 

Neither Hull nor Lewin have attempted to extend their analyses 
from distance in space to stimulus similarity in order to include phenom- 
ena such as displacement and discrimination conflict. Furthermore, they 
have not dealt with the effects of drugs, with neurosis, or psychotherapy. 
There is nothing inherent in their formulations, however, which obviously 
excludes adding the assumptions that would be necessary to include such 
phenomena. H 

The chief advantages of the present formulation are that it has been 
worked out in more detail, applied to a greater variety of phenomena, 
and accompanied by a systematic program of experimental verification. 

Methodological value beyond specific context. The present miniature 
system introduces no innovation in general scientific method or theory 
construction. Outside of its specific content, its general methodological 
value lies only in demonstrating that certain well-known procedures of 
science can be used to integrate a variety of psychological data ranging 
from experimental results on animals to clinical observations on neurotic 
patients. My general strategy has been to use animal experiments to 
formulate and verify principles applicable to neurotic behavior rather 
than to attempt to duplicate in animals the spectacular aspects of neu- 
rotic symptoms. I have tried to deduce with reasonable rigor a variety 
of phenomena from a limited set of principles and the specific conditions 
involved. My tactics have been to proceed step by step, verifying simple 
deductions from the principles in simple situations and then attempting 
to apply the principles and test the deductions in more complex situa- 
tions. I believe that these features of the program, which are by no means 
unique, can be applied successfully to a considerable number of other 
areas in psychology. I would caution, however, that the details of scien- 
tific strategy must vary with the current conditions of the specific prob- 
lem being investigated. 


INTERMEDIATE AND LONG-RANGE STRATEGY 


Most of my research strategy in this area is relatively short-range, 
guided by the results of the immediately preceding experiments and by 
the interests of my students and assistants. 

Development of new measures. One set of problems has to do with 
the development of new measures. As has already been pointed out, a 
variety of measures is needed to test and make meaningful the interven- 
ing variables. New measures are also required for testing deductions in 
those situations, such as psychotherapy, where suitable measures are not 
already available. 
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For some time, I have been interested in trying to develop some in- 
dependent measure of fear, or of the general state of activation or dis- 
turbance of the animal, in the conflict situation. Such a measure would 
allow tests of deductions concerning the effect of strength of approach 
and avoidance drives on the intensity of fear and conflict elicited (the 
double arrows in Figs. 2 and 3); it might also reveal unexpected relation- 
ships. Furthermore, such an independent measure would be useful in 
experiments on avoidance learning, getting the theorist out of the awk- 
ward position of using fear in his explanation of avoidance and avoid- 
ance as his measure of fear. 

So far, a number of attempts to find a convenient physiological meas- 
ure, which might be used in rats with appropriate experimental controls, 
have been unsuccessful. Dr. Arlo Myers has spent considerable time suc- 
cessfully developing beautiful techniques for recording accurately a num- 
ber of physiological measures in the freely moving rat—d-c potential and 
resistance of the kidney, internal temperature of various organs, muzzle 
temperature, and heart rate. To date, none of these has turned out to be 
promising as an index of rapid changes in fear or general activation. We 
shall probably shift to the muzzle temperature and heart rate of the dog 
since other investigators have reported encouraging results with these 
measures [102, 27]. 

I am also interested in recording the psychogalvanic response in 
human subjects and in the scaling of psychotherapeutic interviews, but 
have not yet participated directly in such activities. As has already been 
pointed out, the development of better scales is a necessary part of long- 
range strategy. 

Some applications or extensions of present concepts. Another set of 
problems involves rather straightforward extensions of the present system. 
Conflict behavior should be studied in two-dimensional space to deter- 
mine whether the subjects would circle as one would predict, and 
whether or not the opportunities for lateral movement would reduce the 
extent of oscillations back and forth from the goal. Will the gradient of 
avoidance be steeper than that of approach in time as it is in space? For 
example, when asked to write a paper for a symposium at some distant 
date, the advantages seem to outweigh the disadvantages so that one 
often accepts, but as the date draws nearer, the effect of the disadvan- 
tages increases more rapidly than does the prospect of reward, so that 
one often regrets having accepted. Can such temporal factors be studied 
by rigorous empirical methods? 

The possible therapeutic effect of displacement, or in other words, 
stimulus change, has been opened up by Berkun [6] but needs further 
investigation, All the types of “therapy” described by Masserman [50] 
involve changing the stimulus situation. According to our displacement 
theory, such stimulus changes would be expected to produce some thera- 
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peutic effect because the gradient of generalization of avoidance is 
steeper than that of approach. A concrete example of the effects of such 
stimulus change comes from a recent experiment by Myers [90]. In the 
course of demonstrating that unpredictable electric shock produces more 
generalized fear than does predictable shock, he used the suppression of 
drinking as his measure of fear. He found that, if drinking had been 
suppressed by unpredictable shocks in an apparatus, changing the stim- 
ulus situation by introducing a strong flashing light would tend to cause 
the animals to resume drinking, Another experiment [79] has shown 
that a change from a narrow black to a wide white alley (or vice versa) 
will cause animals inhibited by an approach-avoidance conflict to resume 
eating at the goal. Are the therapeutic effects in Masserman’s experi- 
ments the results of similar types of stimulus change? When this effect 
of stimulus change is controlled by a suitable experimental design, does 
any additional therapeutic effect remain to be explained? To what extent 
is the variable of stimulus change involved in the effect of drugs or of 
electroconvulsive shocks on fear and conflict? 

Is the effect of alcohol which has been studied by Conger [22] partly 
or wholly a function of the fact that the habits of approach were older 
than those of avoidance? How can experimental techniques be devised 
for studying approach-avoidance conflict with children or adult human 
subjects? How well can the concepts be applied to new areas such as 
installment buying in economics and appeasement in international 
relations? 

Significant conflict phenomena not covered by present hypotheses. 
In addition to relatively straightforward extensions of the system, some 
of the foregoing problems are likely to involve a natural-history type 
of observation and data collection in the new types of conflict situations, 
the addition of new assumptions to explain the new phenomena observed, 
and the development of new techniques to measure the phenomena and 
to test the deductions from the new assumptions. The investigation of 
other problems will lead to more radical alterations in the assumptions. 

We have already mentioned the attempt to check on the deductions 
of oscillation from the assumption of a positive feedback created by 
response-produced stimuli. We have also designed experiments to com- 
pare the role in approach-avoidance conflict behavior of cues produced 
by different drives with those of cues produced by different anticipatory 
goal-responses. 

In my opinion [59], an important area to investigate is the possible 
role of conflict in inducing drives over and above the original ones 
motivating the conflict behavior, or in producing more stress and dis- 
ruption than would be predicted from the mere sum of the drives in- 
volved. I am attempting to study conflict-induced drives by building 
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up difficult discriminations, similar to those involved in the experimental 
neurosis described by Pavlov [93]. But the apparatus is designed so that 
a specific response, such as pulling back on a leash or pressing a bar, 
will remove the discrimination problem from the subject. The advantage 
of this technique is that it allows other conditions, such as degree of 
monotonous confinement, to remain constant. 

The goal of the initial study is to demonstrate conflict-induced drive 
by comparing the learning and performance of the “escape from 
discrimination” response under the experimental condition of difficult 
discrimination with that under the control condition of an easy 
discrimination arranged so that the animal is rewarded at exactly the 
same rate. No electric shock or other punishment will be used in these 
discriminations and we are starting with a technique analogous to 
Pavlov’s classical conditioning. 

If a phenomenon of “escape from discrimination conflict” can be 
demonstrated, it will be important to. determine its laws. In order to 
determine these laws, we plan to try to put the response that removes 
the discrimination problem on a variable-interval schedule, Using the 
rate of responding as a measure of drive induced by discrimination con- 
flict, we shall try to find whether this rate is a function of variables such 
as the degree of restraint, monotony, strength of hunger, rate of presenta- 
tion of stimuli, rate of increase in difficulty of discrimination, or drugs. 
Though guided by theoretical hunches, the first stage of such an in- 
vestigation will necessarily involve a considerable amount of systematic, 
empirical exploration. 

Other families of problems have been suggested at the end of a 
previous paper [59]. Can ways be found to study the various types of 
incompatibility; mechanical, neural, chemical, perceptual, and acquired? 
How do complex habits interact? Specifically, when competing patterns 
of response have some subunits that are obviously mutually exclusive and 
others that are identical, how do these subunits interact in the production 
of compromise responses? What is the role of symbolic mediation in 
determining whether or not the complex response occurs all-or-none as a 
whole, or is fractionated by conflict into subunits some of which are in- 
hibited while others are not? 

General policy. In surveying the foregoing types of problems, it 
seems to me that the best strategy is to attack first those problems which 
are most likely to open up new areas of investigation. Two such problems 
are the development of an independent measure of fear or activation and 
the attempt to demonstrate conflict-induced drive. But if these problems 
turn out to be too difficult with the means currently at our disposal, it 
will be prudent to turn to other problems. A large number of alternatives, 
all of reasonably equal a priori fruitfulness, are possible. My preference 
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is to mix studies verifying more complex deductions of the present as- 
sumptions with those aimed at testing such new assumptions as are in- 
dicated by the unexplained behavior observed in the experiments on con- 
flict. The choice of such assumptions will depend on the degree to which 
they have already been found useful in integrating data from other types 
of experiments and the degree to which they seem likely to integrate new 
realms of data. 

It will be noted that my description of strategy is clearer and more 
logical when I am looking backward at the work that has already been 
accomplished than it is when I am attempting to peer into the future. 


Part II. Liberalization of S-R Theory 


The preceding analysis of conflict behavior represents a concrete 
example of a type of activity in which I have been deeply involved— 
trying to cast the first slender rope of a bridge across the chasm 
separating the laboratory from the clinic and other sources of examples 
of complex social behavior. Such work, which is fortunately becoming 
much more conventional, has been greatly facilitated by the general 
context of the Institute of Human Relations under Mark May and owes 
a unique debt to intensive collaboration with John Dollard. 

As a result of trying to take the conventional S-R concepts out of 
the laboratory and apply them to concrete examples of social behavior, 
a number of changes have evolved which significantly increase their 
range and power. This part of the paper will briefly describe the 
liberalization of certain S-R concepts; it will consider certain contro- 
versial problems which require additional theoretical and experimental 
analysis, recapitulate a few key points deriving from collaborative work 
with John Dollard on problems of social learning, personality, and psy- 
chotherapy, and analyze some methodological issues: of critical im- 
portance to the development of behavior theory. 


CHANGES IN CERTAIN BASIC S-R CONCEPTS 


Functional behavioral definitions. Hull [39, 40] and many other 
stimulus-response psychologists have either omitted definitions of basic 
concepts or emphasized physical energy and anatomical location. I have 
been one of those who have believed that it is more useful, and also more 
in accord with our actual practice, to begin with functional behavioral 
definitions of basic concepts, and then subsequently to try to correlate 
the phenomena so defined with physical energies, geometric patterns, 
anatomical locations, or other events that can be independently specified. 


— m 
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The general method is illustrated by the following quotation from Miller 
and Dollard [76, p. 59]: 


It is obvious that “response,” as here used, is not restricted to the con- 
ventional usage, in which a response is defined as a muscular contraction or 
a glandular secretion. It is also obvious that “stimulation” is not restricted 
to the conventional usage, in which a stimulus is defined as an energy 
change activating receptors. According to the present usage, a response is 
any activity by or within the individual which can become functionally 
connected with an antecedent event through learning; a stimulus is any 
event to which a response can be so connected. This definition is not cir- 
cular for cases in which the events referred to have been empirically iden- 
tified. Thus, “sitting up” in dogs is known to be a response because it can 
be functionally connected by reward with the antecedent command, “Sit 
up!”; the command is thereby known to be a stimulus. Stimulus a may be 
used to determine that 1, 2, 3, and 4 are responses; response 1 may be used 
to determine that a, b, c, d are stimuli. Then a prediction may be made 
that the connections b-2, c-3, d-4, etc., are learnable. 


In the procedure just illustrated it is obvious that the first step, 
identifying a specifiable condition as a stimulus because a specific re- 
sponse can be connected to it by learning, is purely empirical, circular, 
and safe. The second step, however, is a prediction from the assumption 
that an event which is the stimulus for the learning of one kind of re- 
sponse can also serve as a stimulus for other responses. This second step 
is not circular because the assumption could, in principle, be disproved. 
In fact, the second step is far from safe; the assumption may actually 
turn out to be wrong. 

The foregoing type of definition has the advantage that “Any 
specifiable attribute of the environment, which the gestalt psychologists 
or other students of perception discover as a consistent basis for dis- 
criminations, can be used in the stimulus, or cue, position of a stimulus- 
response formula” [25, p. 34]. Additional advantages will become ap- 
parent in our discussion of thinking, drive-producing, and cue-producing 
responses. 

Furthermore, I believe that a definition in this general form is the 
only type feasible at present. For example, suppose we want to know 
whether polarized light or radio waves can be stimuli for bees. There 
is no way to decide the matter from a priori considerations of the physical 
energies involved. The only thing that we can do is to set up a dis- 
crimination experiment to determine whether or not physically specified 
changes in the polarization of light, or the onset of radio waves, can be 
correlated with changes in the animal’s behavior. It is conceivable that 
electrical recordings from end organs, sensory nerves, or sensory projec- 
tion areas could be substituted for the learned response. But I doubt 
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whether most of us would want to accept such results if they were con- 
sistently contradicted by discrimination experiments. Similarly, as a first 
approximation, we might substitute for the learned response an innate 
reaction, but I doubt if we would want to call the event a stimulus if, in 
an exhaustive series of experiments, we found it impossible to make any 
learned response contingent on it. In the latter case, we would be forced 
at the very least to designate this as a special subclass of stimuli. 

Stimulus similarity is defined in the same general way. If, in a test 
experiment, a response learned to stimulus A generalizes more to I than 
to 1, we say that I is more similar to A than is 1. Thus far, the procedure 
is perfectly circular. Then, we predict that if we use a new response, the 
same stimuli will order themselves in the same sequence on the new 
gradient of stimulus generalization. This is not a world-shaking pre- 
diction, but it could be wrong. Then we go on to other predictions about 
these stimuli, e.g., that it will be easier to learn a discrimination between 
A and 1 than between A and I. We also predict that certain results 
will be secured in experiments on displacement or on paired-associates 
learning of nonsense syllables, etc. 

If we wanted to start differently, we could use the results of the 
discrimination experiment or of verbal ratings of similarity as the initial 
basis for the definition. But the general pattern of a functional definition 
would remain essentially the same. 

In the same way, I have defined rewards as events empirically found 
to have the effect of strengthening cue-response connections, and have 
pointed out that this definition is not circular as long as the fact that 
an event that has been found to strengthen one connection is used to pre- 
dict that it will strengthen others [76, p. 30]. This kind of definition of 
reward has been discussed in illuminating detail by Meehl [51]. 

Following the same pattern, I would tentatively say that any 
specifiable conditions may be defined as increasing a drive when they 
specifically increase the performance of responses rewarded by the offset 
of those conditions, or by the goal objects that produce satiation. In using 
the word “specific,” I do not mean to imply that the increase in drive 
cannot also increase the performance of other responses, but that it 
should produce a greater increase in the responses that have been 
specifically rewarded by the reduction in, or the goal objects of, that 
drive. Again the definition is circular with respect to the defining re- 
sponse, but the assumption is made that, if a specifiable set of conditions 
can be used to motivate the performance of a given response, it will 
motivate any other response that is reinforced in the same way. 

I am more dissatisfied with the foregoing definition of drive than 
with the others. One of the reasons is that it is tied in too closely with 
a specific hypothesis of reinforcement, drive reduction. Other hypotheses, 
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however, are independent of this definition; for example, the hypothesis 
that all drives have cue function and that any cue will become a drive 
if intense enough. 

All of the foregoing definitions begin with defining the class member- 
ship in terms of a functional property and are circular with respect to 
the specific defining operation. Then they assume that this functional 
property is general so that it may be predicted in new situations. For 
example, if an experiment shows that a reaction of the organism can be 
attached to a new cue and hence is learnable, we classify it as a response. 
Then we assume that it will be possible to attach this reaction to other 
new cues. We can think of many examples in which this assumption 
seems to apply, but we are not certain that it is universally true. For 
instance, it is probably easier to attach the rat’s approach responses to the 
odor of food, and crouching to the sound of a loud buzzer, than it is to 
attach crouching to the odor and approach to the buzzer. We express this 
by saying that approach is higher in the initial hierarchy of responses to 
the odor and crouching in the hierarchy of responses to the buzzer. 
If we can demonstrate that such differences occur in the absence of 
previous learning, we substitute the term “innate” for “initial.” But in 
principle, if we work long enough and are clever enough, we should 
be able to attach the crouching to the odor and the approach to the 
buzzer. If we cannot do this, our theoretical system will have to be more 
complicated. 

The complications may not be great provided we can show that our 
failure was attributable to the presence of a strong response incompatible 
with the one we were trying to establish. If, as actually may turn out to 
be the case, we find other exceptions to the generality of functional 
properties, so that an event may serve as a cue for some responses but 
not for others, our theoretical system, or any other one, will have to be 
distressingly complex, unless simple laws can be found to describe the 
exceptions. 

We are confronted by exactly the same thing with respect to all of 
the other categories. That is, for example, our system becomes much: 
more complicated if the effects of certain types of reinforcement tend to 
be specific to certain types of responses. The difficulty is not a major 
one if we can find simple laws to describe such specificities, but if they 
exist and we cannot find such simple laws, any theory of learning be- 
comes much more complicated. 

So far, we have described operations which can be used to classify 
a specifiable event as a member of a specific category. I believe definitions 
of this kind are fundamental, but it obviously will be very advantageous 
if we can find relationships between our lists of events in different 
categories and any other independently specifiable attributes of the 


242 NEAL E, MILLER 


events. For example, it would be immensely helpful if we could say that 
any movement of the striped muscles was learnable and hence could be 
described as a response, or that certain geometrical or gestalt properties 
defined a continuum of stimulus similarity. Nevertheless, I believe that 
we need our functional definitions in order to determine the laws essential 
for the subsequent more powerful and parsimonious definitions. We shall 
discuss this problem in more detail in connection with the drive-reduction 
hypothesis of reinforcement. 

Even after we get over the hurdle of the generality of our functional 
classification, many other problems remain. For instance, having defined 
response, we are confronted with the problem of response patterns or 
units. One aspect of this problem has been raised in Logan’s [49] dis- 
cussion of the micromolar approach to behavior theory. We are also 
confronted with the problem of determining the compatibility and in- 
compatibility of various responses [59]. 

In general, stimulus-response psychologists have tended to bypass 
problems of the type we have just been raising. By intuition and trial 
and error, they have concentrated on experimental situations in which 
the stimulus and response were so simple and manageable that the lack 
of more precise definitions or laws concerning these variables was not a 
practical problem. Using such situations, stimulus-response psychologists 
have concentrated on determining the laws governing the connections of 
responses to stimuli. Thus, stimulus-response psychologists may be said 
to know and care relatively little about either stimuli or responses; they 
are specialists on the hyphen between the S and R and could more aptly 
be called “hyphen psychologists,’ or to use Thorndike’s term, “con- 
nectionists.” While admitting our ignorance, I trust it is not necessary 
to point out that we have made considerable progress on this basis. Un- 
fortunately, it is also true that other theorists are not far ahead of us in 
solving these particular problems. 

Thinking: central cue- and drive-producing responses. One of the 
most important advantages of functional definitions of stimulus and 
response is that such definitions can be applied to central as well as to 
peripheral events. Instead of emphasizing anatomical location, our 
definitions direct attention toward the more significant problem of 
functional laws. These definitions free the S-R theory of thinking from 
being restricted to proprioception, allowing the theory to exploit images, 
response-produced drives and rewards, perceptual responses, perceptual 
learning of acquired distinctiveness or similarity, and the possibility that 
central responses can contribute to the focusing of attention [76, pp- 
54~90].° 

"Recent work on the reticular formation may give us the first suggestion of 
the types of central response that might be involved in the focusing of attention. 
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Adding to the peripheral responses, the possibility of images and 
other central responses made possible by myriad potential connections in 
the brain greatly increases the number of different distinctive cue-pro- 
ducing responses that are available and that do not interfere with overt 
activity. Therefore, the new functional definitions make it easier for 
stimulus-response theory to handle the more complex, higher mental 
processes. In fact, one might deduce that the animals with a greater 
capacity to make central responses, and especially with a greater capacity 
to make a number of them simultaneously, should be superior on prob- 
lems such as delayed response and reasoning. 

It is obvious that the postulation of central responses, such as per- 
ception and imagery, reduces the gap between S-R and cognitive theory. 
But there is still a difference in that we clearly assume that these central 
processes follow the same laws as do peripheral stimuli and responses. 
By contrast, cognitive theories tend to leave the laws governing the 
central processes rather vague, while implying that they are different 
from those governing peripheral responses [106]. 

It is also obvious that our cue- and drive-producing responses serve 
much the same function as Hebb’s cell assemblies and phase sequences 
except that we have not made specific neurological hypotheses, have 
applied them to somewhat different problems, and have assumed that 
contiguity alone, without reinforcement, is not sufficient for producing 
learning or maintaining performance. Though Hebb has attempted to 
deal with the empirical phenomena of reinforcement and experimental 
extinction, I have not been able to convince myself that his treatment is 
adequate. To cite two examples: I am skeptical of his explanation of 
why a cell assembly should grow to a certain size and then stop, instead 
of continuing to capture new neurons by contiguity until the whole brain 
was fired in one magnificent convulsion; nor do I see why an animal that 
was maintained under chronic moderate hunger should not have his 
sequences disturbed, and hence display motivation, when his metabolism 
was changed by satiation. 

It is easier to determine the laws of peripheral stimuli and responses 
because they can be independently manipulated and objectively meas- 
ured. After determining these laws, we explore the consequences of 
assuming that internal processes can be described in terms of “stimulus- 
response” sequences following exactly the same laws.” This assumption 
may turn out to be incorrect, but at least it focuses attention squarely 
on the problem of trying to formulate the laws governing the central 
processes and gives one a point of departure. It seems worthwhile to 

"This is the key assumption. For example, if further experimental work should 


indicate that contiguity is sufficient for forming associations between external cues 
and responses, we would attempt to apply the same law to central processes. 
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begin with the parsimonious assumption that the relatively hidden central 
processes follow the same laws as the observable peripheral ones, see how 
far one can go on this basis, and then add different assumptions if 
necessary. 

Same laws can produce different outcomes. It can be seen that I 
am tentatively assuming that the greater intellectual capacities of animals, 
such as man, which have much more highly developed central processes 
along with their anatomically better developed brains, are attributable 
to the potentialities of compounding more similarly functioning “units” 
rather than to a completely new type of function. To use a simple analogy, 
a calculating machine with many banks of keys and many dials may in- 
volve exactly the same kinds of parts and functions as one with only two 
banks of keys and two dials. But as a result of more replications of 
functionally similar units, one is able to do completely new things with 
the larger machine. For example, one could not use the smaller machine 
to apply the machine formula for computing the constants for a cor- 
relation (the sum of squares and cross products according to the formula 
(A + B)? = A? + 2AB + B°) because one would not have enough 
dials to keep the two sums of squares and the cross products from 
running into each other. Merely because it has more units of the same 
kind, the larger machine can keep these different sums separated and 
can use this new and much more economical method which is completely 
beyond the capacity of the smaller machine. 

From the foregoing example, it should be clear that, assuming that 
the principles are the same, does not necessarily mean that one must 
predict that the outcome will be the same. The outcome is dependent 
upon the conditions as well as the principles. In the foregoing example, 
changing the conditions by adding more units to the machine enabled 
it to perform a completely new task. Similarly, compounding S-R links 
in series and parallel leads to completely new deductions. 

Superiority of creative thought over symbolic trial and error. As 1 
have pointed out [25, p. 111], instrumental acts are limited by the en- 
vironment to one of the sequences leading from the start to the goal, 
while by contrast, an important attribute of cue-producing responses is 
that they are not limited in this way and can also proceed in the opposite 
sequence, working backward from the goal. Concretely, when one is at 
the office, one can get home only by walking in a given direction through 
all the intervening steps on one of the possible routes, but one can think 
of being home without thinking of all the intervening steps, and one can 
also think of walking in the opposite direction from home to the office. 
On the way home, one’s behavior can be guided by thoughts of the road- 
block near home. 

Similarly in trying to derive the formula for the area of a triangle, 
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one can begin near the goal by thinking: “If I can make it into a 
rectangle, I’ll know how to get the area.” Then one can be guided by 
this thought in trying to think of various ways of making a triangle into 
a rectangle. 

The extreme importance of the distinction between mere symbolic 
trial and error and thinking guided by (or working backward from) the 
goal should not be underestimated. Figure 10 illustrates the situation 
in which there are four alternatives at the first choice point, each of 
which leads on to four additional alternatives, etc. By the time we reach 


Goal 


Start 


Fic. 10. Difficulty of canvassing all the alternatives by 
symbolic trial and error contrasted with efficiency of 
being guided by, or working backward from, the goal. 


the third stage of choices. there are 64 alternatives, and if we carried 
the figure out four additional stages, there would be 16,384, i.e., 
47, To canvass all these alternatives by symbolic trial and error would 
be a considerable task. However, to the extent that the behavior may be 
guided by the goal, the difficulty is reduced. To take the extreme case, 
if the goal is clearly in sight, or if one works backward from the goal, 
it is not necessary to canvass a single wrong alternative. 

But the foregoing example is extremely simple compared with the 
task of canvassing all the conceivable moves in a chess game, or the suc- 
cessive stages of alternatives in many real-life problems. Thus, it be- 
comes obvious that there are definite limitations to mere symbolic trial 
and error, and that goal-directed thinking of the type we have sketched 
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here and analyzed elsewhere [25, pp. 110-115] is enormously more 
effective. 

We can see that some of the earlier writers who treated thinking as 
mere symbolic trial and error missed its most important attribute. Its 
ability to be guided by the goal (the definition of the problem), so that 
the dice are loaded in favor of combinations of habits likely to succeed, 
is one of the key characteristics of creative thinking. 

Do central responses require less reinforcement? Let us conclude with 
a far simpler example of how the same laws might produce different re- 
sults with central processes. Incidental observation suggests that the learn- 
ing and maintained performance of external responses require less rein- 
forcement the less effortful they are. If we experimentally verified this 
principle and made the plausible assumption that central responses re- 
quire minimal effort, we would predict that central responses could be 
learned and maintained with minimal reinforcement. Relatively weak 
secondary reinforcements should be much more likely to produce the 
incidental learning of such responses. Therefore, organisms in which 
central responses play a larger role should show more apparently latent 
learning. 

Studies stemming from modern S-R analysis of higher mental 
processes. The foregoing type of analysis has led directly to a number of 
experiments on the role of cue-producing responses. I shall summarize 
only those with which I have been directly connected. Miller [53] 
showed that providing rats with a distinctive anticipatory goal-response 
increased their ability to show “foresight,” and in another experiment 
[54] that human galvanic skin response could be conditioned to the cue of 
a word spoken aloud, when then generalize to the cue of thinking the 
word, and thereby, could be transferred to any new stimulus arousing the 
thought of that word. Grose [30] secured additional evidence demonstrat- 
ing almost complete generalization of the conditioned galvanic skin 
response from the cue of saying a word aloud to the cue of thinking it, 
and also from the cue of thinking to that of speaking the word. In ex- 
periments on children, Birge [9] showed that a distinctive label could 
mediate the transfer of an instrumental response from one situation to 
another. She also showed that similar instrumental responses tended to 
mediate the transfer of similar labels. Finally, Lawrence [45, 46, 47] 
performed a series of experiments showing how training animals to pay” 
attention to certain classes of cues can influence subsequent learning. 

The same type of analysis has led to an understanding of the im- 
portant role of learned drives and rewards, or in other words, emotional 
factors in thinking. It has indicated that in order to be creative one needs 
a nice balance between the courage to rebel against conventional formula- 
tions and the discipline to submit to the restraints of logic and evidence. 
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This type of analysis has suggested the advantages of alternating between 
phases of uncritical production of ideas and critical selection of them. 
It has led to an explanation of why repression should make behavior 
more primitive, childlike, and stupid. It has also shown why psycho- 
therapy should meet resistance, but if successful, lead to more intelligent, 
farsighted, adaptive behavior [25]. 

Finally, the notion of response-produced strong stimulation has been 
applied to studies of learnable drives and rewards [64]. 

Some unsolved problems. Many problems remain, however. Most 
theories of intelligent behavior have surprisingly little to say about the 
difference in capacity responsible for different levels of intellectual per- 
formance in animals or man. There is a striking contrast between the 
success of psychologists in devising tests to measure intelligence and their 
failure to devise adequate theories of the capacities responsible for dif- 
ferences in intelligence. To date, most theorists have largely ignored the 
problem of capacity, merely listing traits or assuming differences in such 
things as verbal responses, unspecified parameters of equations, emergent 
levels of learning, or an ability to reorganize bad gestalts into good ones. 
Although such hunches are all right as a point of departure, they do not 
constitute a useful theory of the obvious intellectual differences between 
toads and men. For example, although verbal responses certainly are 
involved in thinking, other capacities are also required, as I predict any- 
one will discover who tries to teach a talking bird to solve problems in 
algebra. 

In the foregoing discussion and elsewhere [76, pp. 72, 89], I have 
suggested a few factors which might be involved. One of the factors 
which make the mental processes of man so much “higher” than those 
of animals may be a greater capacity to respond selectively to more subtle 
aspects of the environment as cues—especially to aspects such as triangu- 
larity, numerosity, and other relations and patterns which are commonly 
referred to as “abstract.” Another factor may be a capacity to make a 
greater variety of distinctive central cue-producing responses, and 
especially a greater capacity to respond with a number of different cue- 
producing responses simultaneously so that further responses may be 
elicited on the basis of a pattern of cues representing several different 
units of experience. But many of the most interesting problems remain 
untouched. 

For instance, what is the “filing system” of the brain? To take a 
simple example, one might ask the question: “What mammal flies at 
night?” The answer, “Bat,” comes immediately. One does not have to 
canvass either a mental list of all mammals to see which ones fly (or 
search a list of all flying creatures to see which ones are mammals) and 
then eliminate the creatures that are not nocturnal. The prompt elicita- 
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tion of thoughts on the basis of such multiple requirements (i.e., cues) is 
obviously an enormous aid to efficient problem solving. How does the 
brain accomplish this? To what extent is it a matter of mere algebraic 
summation? How much patterning is involved? Under what conditions 
is some sort of scanning required? What kinds of training are necessary? 
I venture the hypothesis that such a response to multiple cues (i.e., re- 
quirements) originally is a somewhat inefficient matter of simple sum- 
mation, but that with specific training in responding only to correct 
combinations, patterning leads to more precisely selective behavior. Such 
problems could be analyzed experimentally. 

Relational responses to relational cues: flexible cybernetic behavior. 
Stimulus-response psychologists, who tended to use the old definition of 
the stimulus as a specific energy (or energy change) and of the response 
as a specific muscular contraction or glandular secretion, had difficulty 
in explaining the flexibility of behavior exemplified by the appropriate 
modification of adaptive responses in new situations—a serious difficulty 
because no two situations are completely identical. Critics such as Lashley 
[44] and Tolman [105] have long since emphasized these difficulties. 
Furthermore, the emphasis of the gestalt psychologists on total patterns 
did not solve the problem of how the stimulus pattern was translated 
into the response one. Hull’s [37] habit-family hierarchy was an attempt 
to solve this problem. But Hull, rightly, was dissatisfied with his deduction 
of this mechanism and, hence, also advanced the habit-family hierarchy 
as an independent postulate, a procedure which takes account of, but 
does nothing to explain, the flexibility of behavior. It seems probable that 
a mechanism of the type Hull tried to deduce in this paper does operate 
in certain cases and that the assumption of central perceptual and 
imageal cue-producing responses would greatly increase the power of the 
mechanism. However, I believe that there also is a simpler and more 
radical solution to at least part of the problem. 

The essentials of a new approach are contained in an analysis of 
copying [76] which turns out to be functionally similar to the engineering 
developments recently highlighted by Wiener under the name “cyber- 
netics.” This type of analysis depends on guidance by feedback and on 
the assumption of relational responses to relational stimuli. These “new” 
types of stimuli and responses fit easily into our functional definitions. 
There is considerable evidence for their existence. 

In the original example [76], a novice struggling to master singing 
on key first learned to respond to the cue of his own note being below 
that of the model’s by tensing his vocal chords to raise the pitch of his 
voice—i.e., he responded to the direction of a difference with a direction 
of adjustment. Similarly, he learned to slide the pitch down when he was 
singing too high. With practice, he learned to respond to larger dis- 
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crepancies with larger adjustments and smaller discrepancies with 
smaller ones. Concurrently, the cue of difference of pitch was becoming 
a learned drive and the cue of sameness a learned reward so that sub- 
sequent behavior could be shaped by response-produced drive induction 
and reduction. 

The assumption of certain elements of similarity among different in- 
stances of being flat (as contrasted with being sharp or on key) allowed 
these responses of approximation and correction to generalize to new 
pitches. In these new situations, the same relational response, €.g-, 
tensing, produced the appropriate correction even though the absolute 
level of pitch was different. 

In the same way, a person might learn to respond to the cue of 
seeing his hand to the left of a pencil by moving to the right until his 
path of approach was directly lined up with the pencil. On different 
occasions, the same relational response, moving to the right, might be 
made to the same relational cue, too far to the left, although the specific 
patterns of absolute stimulation and movement might be quite different. 

The same type of analysis might be applied to the general orienting 
responses of the head and eyes, with the subject learning to turn until 
his eyes were lined up correctly, and then to approach or avoid. It is easy 
to design a simple mechanism that will approach a light in this manner, 
always following the light although never repeating the exact pattern of 
specific movements. In fact, a simple phototropic organism, such as the 
single-celled Euglena, can respond only in this way. It is obvious that 
cybernetic responses of this type are much more flexible than specific 
ballistic responses to specific push-button cues, and that an S-R analysis 
does not need to be limited to the latter. 

In other publications [25, p. 87; 64, p. 466], we have pointed out 
how such responses might function to produce goal-oriented learned 
drives. 

A more rigorous and comprehensive attempt to apply this cybernetic 
type of analysis should involve an experimental program determining 
what types of difference, direction of difference or magnitude of dif- 
ference, can serve as cues, and how a response learned to such a cue at 
one level of a continuum such as pitch, will generalize to other different 
levels and even conceivably to other continua. For example, Miller [52] 
found that children readily use “the box-of-a-different-color” as a cue 
and that a response to this cue will generalize to new combinations of 
colors; similarly, Harlow et al. [33] have found that monkeys can be 
taught to respond to the cue of “oddity,” or in other words “the one 
object that is different in any group of three.” 

The addition of other features, such as the habit of responding to 
a barrier by backing up and moving at a tangent, would be needed 
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to round out the analysis. If such a detailed analysis were made and 
experimentally tested, we might be able to account for the goal-oriented, 
flexible quality of behavior rather than merely assuming it. We might 
also account for failures to adjust immediately to certain changes. There 
is no reason why such an analysis cannot be made in modern stimulus- 
response terms. 

Role of proprioceptive feedback. There are many additional problems 
which could be the basis for experimental programs. Often, a response 
of a given absolute magnitude must be made without any chance for 
guidance all of the way to the goal by concurrent feedback—e.g., to sink 
a putt, the golf ball must be hit in the right direction with the right 
force. It seems highly probable, however, that such responses can be 
guided by proprioceptive feedback, with the subject learning to increase 
the force until he produces a certain proprioceptive stimulus and then to 
stop. In such cases, the problem of different response amplitudes will 
reduce to proprioceptive discrimination, and response generalization will 
be a special case of stimulus generalization. 

Responses governed by proprioceptive feedback will necessarily be 
somewhat slow; in fact precise, delicate responses usually are relatively 
slow. But is this the sole mechanism, or is there some cruder, faster 
mechanism whereby responses of grossly different amplitude can be made 
without waiting for control by proprioceptive feedback? 

Learning to correct for distortions—man’s capacity as a computer. 
A whole different set of problems is raised by man’s enormous capacity 
to adjust rapidly to various forms of perceptual and motor distortion. 
It is obvious that relational responses to relational cues, guided by con- 
tinuous feedback, will not be greatly affected by distortions unless the 
relationships are actually reversed as in mirror drawing. However, with 
a response like the golf putt, that cannot be guided by continuous feed- 
back all of the way to the goal, the correction has to be made in advance. 
We automatically “compute” such corrections when we adjust to a faster 
or sloping green, a longer or heavier club, or allow for the wind. If one 
looks through prisms which displace everything to the left, one does not 
have to practice on every separate point before one can look, close one’s 
eyes, and then touch objects reasonably accurately. Similarly, one rapidly 
adjusts to various mechanical systems that magnify or reduce one’s 
movements. 

How much practice of what kinds does one need to correct for dif- 
ferent kinds of distortion? How much transfer is there to new points 
in the distorted space? Are some kinds of distortion (i.e., curvilinear 
relationships or reversals) more difficult than others? The use of an 
analogue computer to introduce precisely specifiable changes into the 
S-R relationships in a simple psychomotor task, such as looking at a point, 
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closing one’s eyes, and then moving a pointer to touch the point, ought 
to enable one to analyze man’s capacity as a computer to correct for 
various types of systematic distortion. Such experiments would give the 
theorist new problems and new data. 

Multiple cue and response potentialities. By now it should be ap- 
parent that I believe that most stimulus objects present the organism with 
a multiplicity of potential cues. Thus, the organism may learn to respond 
to the absolute position of a stimulus object, to its relative position, to its 
absolute brightness, to its relative brightness, to its color, to its form, to 
its being the one object that is different, etc. 

Some of these cues may be relatively direct products of end-organ 
stimulation, others may be the result of various levels of innate mecha- 
nisms for analyzing and processing such stimulation, and yet others may 
be the result of learned cue-producing responses. These cues may differ 
in innate perspicuousness, which in turn may be modified by learned 
factors such as end-organ orientation and the focusing of attention. 

Similarly, the organism has many responses in its repertoire. For 
example, stimulation of the spinal cat can elicit coordinated walking, 
running, or hopping responses which presumably are organized on an 
innate basis and vastly simplify the problem of learning locomotion. As 
we have already pointed out, responses can probably be made either on a 
relational or on a more absolute, ballistic basis. Furthermore, it is con- 
ceivable that the innate wiring diagram of the organism allows for the 
possibility of responses which, in effect, specify a given new posture as a 
“balanced” or “null” position so that deviations from that posture 
produce feedback inducing movement in the proper direction until the 
posture is restored. A simple example of a mechanism of this type is a 
recording instrument in which a signal unbalances one arm of a bridge 
circuit, causes a galvanometer to be deflected from a null position, and 
thereby actuates a motor which continues to run in the right direction 
until the other arm of the bridge has been changed enough to bring 
the circuit back into balance. 

Therefore, the same physical movement to the same physical stimulus 
object may in different instances be an example of a different type of 
response to a different part of the population of cues. The only way 
to determine for certain what response to what cue is the dominant one is 
to perform a series of critical tests in which different potential responses 
to different potential cues are pitted against each other. After a sufficient 
series of tests, one may predict the performance in additional test situa- 
tions. We do not yet have the laws to enable us to make many predictions 
directly from the experimental conditions but there is no reason why we 
can not begin to discover such laws if we abandon relatively sterile either- 
or controversies and devote ourselves to the determination of the laws. 
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At present we are limited to the rather vague but general principle 
that differential reinforcement can cause organisms to learn to respond 
to a particular aspect of the population of potential cues with a particular 
type of potential response. In general the environment seems to reinforce 
responding to objects with acts that produce certain end results. 

Maturation of concepts—learning sets. I have just argued that most 
stimulus situations confront the subject with a multiplicity of potential 
cues. My first publication [52] showed that the relative dominance of 
such cues can vary with age. Using a convenient technique, the critical- 
choice direct delayed reaction, which required no verbal skills, I found 
that position was the dominant cue for the youngest children. But with 
increasing age, first a simple configuration and then color became dom- 
inant over position for most children in the situations used. 

The shifts in relative dominance of cues observed in these and other 
studies might be produced in two ways: (a) the mechanisms responsible 
for the ability to respond to these cues, and hence for their perspicuous- 
ness, may mature at different rates, and/or (b) the subject may be re- 
warded for learning to pay attention to different cues [76, p. 72]. 
Furthermore, it is possible that the mechanism for learning to pay 
attention to a specific cue is subject to maturation during childhood. 
Incidentally, recent studies of the functions of the reticular formation 
demonstrate considerable central control over afferent impulses and 
may supply a neurophysiological mechanism for the “focusing” of at- 
tention as well as new techniques for investigating this problem in ani- 
mals. 

It can be seen that learning to pay attention to specific types of 
cues could be the basis for the acquired distinctiveness which Lawrence 
[45, 46, 47] has studied in my laboratory and the learning sets which 
Harlow and his students have studied at Wisconsin. Furthermore, ma- 
turational (and learned) changes in the capacity to respond to cues 
and in the dominance of different cues should have an important effect 
on the concepts which children of different ages are likely to acquire. 
It seems reasonable to suppose that the more subtle and “abstract” cues, 
such as numerosity and circularity, which may depend on central mech- 
anisms for processing information, would be especially susceptible to 
effects of postnatal maturation and of learning. 

Similarly, the relative position of various members of a hierarchy of 
responses may be modified by both maturation and learning, and this 
relative position will be expected to affect the child’s readiness to learn 
specific types of skill or knowledge. 

It is obvious that empirical work sharpening up the definitions of 
functional units of cue and response would contribute to a better under- 
standing of the foregoing problems. 


— 
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COMMENTS ON SELECTED CONTROVERSIAL ISSUES 


This part of the paper continues the general discussion of the way 
in which I differ from certain other stimulus-response psychologists by a 
brief discussion of a number of controversial issues. 


Do Drives Fall into the Class of Stimuli? 


The notion that characteristic stimuli are associated with drive states 
is relatively old but not completely accepted. I have explored some of the 
consequences of carrying this notion to its logical conclusion by assum- 
ing that all drives are strong stimuli and that any stimulus becomes a 
drive if intense enough [76]. 

One deduction from this assumption is that drive generalization 
should occur as a special case of stimulus generalization. This deduction 
has been tested in a series of experiments, first briefly summarized [57] 
and then reported in more detail after World War II [61]. The first of 
these experiments showed that, if thirsty rats are trained to run down a 
short alley for water and then tested without thirst, they will not only 
run faster but also drink more if tested with the irrelevant drive of 
hunger than if tested without this irrelevant drive. Since hunger and 
thirst are so closely related, a second experiment was performed in which 
hungry rats were trained to take the correct arm of a T maze to secure 
food. Subsequent tests without hunger showed that they would both 
run faster and choose more correctly if tested with the irrelevant drives 
of either electric shock or fear than if tested without these additional 
drives. 

Additional evidence was subsequently secured in our laboratory by 
von Felsinger [107] who showed that induced estrus (presumably in- 
volving sexual motivation) increased the running speed and improved 
the error scores of two groups of satiated rats which had originally been 
trained to run a maze when hungry or thirsty respectively. Similarly, 
Webb [108] at Iowa found that thirst as an irrelevant drive increased the 
performance by food-satiated animals of a habit originally learned under 
hunger. Although Webb’s experiment had the advantage of using more 
than two levels of irrelevant drive, his thirsty animals may have stopped 
eating dry food and thus been somewhat hungry when tested. 

Hull [39], who was familiar with the results of the first experiments 
in this series, interpreted them in a different way. Instead of a gradient 
of drive-stimulus generalization, he assumed a completely general ener- 
gizing effect, possibly mediated by substances in the blood, Aside from 
the fact that the blood-substance subhypothesis would be too slow to 
account for rapid response to pain, the chief difference between Hull's 
hypothesis and mine is that I would predict the type of relationships 
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that would be expected from a gradient of stimulus generalization. For 
example, if irrelevant drive A had more effect on a habit based on drive 
B than drive C, then a habit based on drive A should be more effected 
by B as an irrelevant drive than by C. By contrast, Hull’s 1943 postulate 
predicts completely equal effects, although his 1952 one is more guarded 
in this respect. The experiments required to decide between these two 
main alternatives have not yet been performed. 

Response-produced drives and rewards. By combining the hypoth- 
esis that strong stimuli serve as drives with the notion of response-pro- 
duced stimulation, one arrives at the hypothesis that learnable drives are 
the product of responses that produce intense stimulation. One also 
arrives at the hypothesis that secondary reinforcement is produced by 
responses that reduce strong stimulation. It should be remembered that 
these responses might occur within the central nervous system. These 
hypotheses, which have led to a considerable amount of significant work, 
have been presented in detail elsewhere [64, 25]. Some further thoughts 
will be presented in a subsequent section on learned elicitation vs. learned 
channeling. 

Drive stimulation vs. reduction in threshold. A number of workers 
[96, 4, 99] have assumed that, instead of acting as strong stimuli, drives 
serve to reduce thresholds. One of the original points of departure was 
Richter’s [96] observation that andrenalectomized rats would prefer 
saline to tap water, even with low saline concentrations that elicited no 
differential response from normal control rats. This better discrimination 
of the rats with great salt need was interpreted as a reduction in the 
threshold for the taste of salt, produced by the specific need or drive. 
Subsequent work has demonstrated that the threshold of the sense organs, 
measured electrophysiologically, remains the same [94], and that, if nor- 
mal rats are properly motivated by electric shock, they can discriminate 
just as well as adrenalectomized ones [34]. Thus the original support is 
pulled out from under the threshold hypothesis. 

Meanwhile, however, Beach and Holz-Tucker [4] were led to the 
threshold hypothesis by the observation that increasing the androgen 
level in the blood of male rats increases the range of stimulus objects to 
which they will exhibit copulatory responses. As we have already pointed 
out in the earlier part of this paper, such an observation could just as 
well be explained by assuming that increases in drive raise the level of 
the entire gradient of stimulus generalization. 

It should be clear that going through the experimental operations to 
demonstrate a reduction in threshold of response to a stimulus does not 
rule out the possibility that this reduction was produced by the presence 
of some other source of stimulation. The traditional physiological and 
psychological experiments on the positive summation of stimuli demon- 
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strate that the threshold of response to the test stimulus is reduced by 
the presence of the one with which it summates. Conversely, if one deals 
with the central rather than the peripheral effects of stimulation, it is 
obvious that a genuine reduction in threshold might cause a stimulus to 
function as though it were intensified. It is actually quite difficult to dis- 
criminate between these two hypotheses: drive as a strong stimulus, or 
drive as a state which reduces the threshold of peripheral end organs or 
central synapses. 

Sheffield and Campbell [99] and Campbell and Sheffield [18] have 
presented evidence purporting to support the threshold hypothesis by 
disproving the drive-stimulus one. They show that, if a rat is kept under 
relatively soundproof conditions, his rate of activity does not go up when 
he is hungry, but that the introduction of a novel stimulus will greatly 
increase the activity of the hungry rat and still more increase it if the 
novel stimulus is regularly associated with feeding. It is assumed that, if 
drive were a stimulus, it would have to produce similar increases in 
activity. Since such an increase was not observed, it is concluded that the 
function of the drive must have been solely to decrease the threshold of 
responsiveness to the novel stimulus, an assumption functionally similar 
to Hull’s [39] postulate that drive serves to multiply whatever habit 
strength is present. 

Although these experiments show that drives do not inevitably goad 
the animal into activity, they do not rule out the possibility that drives 
are strong stimuli. An electric shock is a strong stimulus. Whether an 
animal shows increased activity by jumping or decreased activity by 
crouching to an electric shock depends on whether the reward of turning 
off the shock is administered for jumping or for crouching. It will be 
remembered that my definition of drive says nothing about activity, but 
deals rather with increasing the performance of responses rewarded by 
the offset of the drive or by the goal objects that produce its satiation. 

In Fig. 1 of the article by Sheffield and Campbell [99], it will be 
noticed that, whereas the activity in response to a stimulus that im- 
mediately precedes daily feeding, increases during a series of trials, the 
activity to a control stimulus that does not precede feeding progressively 
decreases, It seems logical to me to assume that the nonrewarded re- 
sponses to the latter stimulus gradually extinguish and that the non- 
rewarded responses to the gradually mounting drive stimulus also ex- 
tinguished in the context of the other constant cues in the Sheffield- 
Campbell apparatus. a 

To state the converse side of the argument, suppose that at unpredict- 
able intervals, without providing any stimulation, the experimenters had 
introduced food alternately at opposite sides of their activity apparatus 
throughout the test session. Then the animals would have been reinforced 
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for shuttling on a variable-interval schedule. We would expect their 
shuttling, which would be recorded as activity, to increase with increas- 
ing hunger just as does the rate of bar pressing [67]. In short, we can 
only predict that subjects will be active to a pattern of stimulation in- 
cluding strong drive when activity has been reinforced to that pattern. 
The situation used by Sheffield and Campbell [99] involved long periods 
during which activity was not reinforced to the pattern of drive stimulus 
plus apparatus without the sound of the food delivery mechanism. 


The Nature of Reinforcement 


It seems to be a fact that some stimulus situations are much more 
effective than others in determining whether or not the responses that 
lead to them will be learned and performed. Thus, all theorists are forced 
to assign the empirical law of effect some role in their theories. A con- 
sistent application of the empirical law of effect would result in a long 
list of stimulus situations that can serve as rewards and of those that are 
relatively neutral. In practice, most experimenters limit themselves to 
relatively few situations which are known to function as effective rewards. 

The drive-stimulus-reduction hypothesis. Since a catalogue of re- 
wards is so cumbersome, some theorists have looked for a simple general 
principle, or principles, that will allow them to determine whether an 
event should be listed as a reward or not. One attempt at such a prin- 
ciple is the drive-reduction hypothesis of reinforcement. 

It is well known that turning off a strong motivational stimulus, such 
as an electric shock, will serve as a reward to reinforce whatever response 
the animal was making just before he escaped from the shock. It is also 
known that food will serve as a reward for a hungry animal, and that 
this same food, if given in sufficient quantities, will reduce the strength 
of the hunger drive. The drive-reduction hypothesis attempts to abstract 
a common element from observations of this kind. In its weak form, it 
states that the sudden reduction in the strength of any strong motiva- 
tional stimulus always serves as a reward, or in other words, is a su ficient 
condition for reinforcement. In its strong form, it states that all reward 
is produced in this way, or in other words, that drive reduction is not 
only a sufficient but also the necessary condition for reinforcement. 

By defining drives as strong stimuli, I have sharpened the hypothesis 
into the assumption that it is the sudden reduction in the strength of 
intense stimulation that serves as a reinforcement. 

Not completely committed to drive-stimulus-reduction hypothesis. 
Although I believe that the foregoing hypothesis has a considerably less 
than 50 per cent chance of being correct, especially in its strong form, I 
do believe it is better at the present moment than any other single hy- 
pothesis. Therefore, I feel that it is worthwhile to try out applying it 
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consistently, if only to highlight the obstacles and infuriate others into 
devising superior hypotheses and the experimental programs to support 
them, 

When one systematically explores a given hypothesis and points out 
the weaknesses of various theoretical and experimental attacks on this 
hypothesis, however, it is difficult to avoid the reputation (and after that, 
the fact) of being emotionally fixated on the hypothesis. Furthermore, 
the very controversial nature of the drive-stimulus-reduction hypothesis 
makes it conspicuous, so that it seems to be the cornerstone of one’s whole 
theoretical thinking. 

Let me try to destroy both illusions. The stimulus-reduction hypoth- 
esis of reinforcement could be discarded without having an appreciable 
effect on the rest of my theoretical formulations. I take this occasion to 
urge attempts to formulate and rigorously test competing hypotheses, and 
time permitting, may even join in that activity myself. However unsatis- 
factory, the drive-reduction hypothesis is not likely to be abandoned as 
long as it is the best thing of its kind that we have. The decisive way to 
kill it is with a superior alternative. 

Various possible bases for correlation between reinforcement and 
satiation. Let us start out with the original observation that there seems 
to be a correlation between the operations of satiation and those of re- 
inforcement. This is not surprising because animals that are not rewarded 
by substances that reduce their drives are likely to come from a long 
line of extinct ancestors. Furthermore, the process could also work the 
other way— individuals that do not eventually become satiated on a given 
form of pleasure may starve to death while enjoying the thrill of tickling 
themselves. 

We should remember that a correlation does not necessarily indicate 
a direct causal relationship. The drive reduction could produce the re- 
inforcement, the reinforcement could produce the drive reduction, or 
natural selection could have built a correlation into the organism that did 
not involve any direct causal relationship. For example, an increase in 
pleasure could be the true basis for reinforcement and animals could have 
evolved by natural selection so that in general only nutritious substances 
taste good and the pleasantness of these is some function of the strength 
of hunger. 

Sample studies illustrating fruitfulness of hypothesis. If natural se- 
lection has produced a spurious correlation, we would expect this cor- 
relation to be more likely to disappear if we manipulated drives in un- 
usual ways which have not been encountered in natural selection. On the 
other hand, if it is a true causal relationship, it should survive these un- 
usual manipulations. 

In order to test the drive-reduction hypothesis in this way, and also 
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to learn more about the mechanism of drives, I have been involved in a 
series of experiments which have tried to induce and reduce drives in 
unusual ways [73, 7, 78]. These experiments have shown that food in- 
jected via a plastic fistula directly into the stomach of a rat promptly 
reduces hunger as measured either by the amount of food subsequently 
eaten or by the rate at which the animals will work at pressing a bar to 
secure food on a variable-interval schedule of reinforcement. But food 
taken normally by mouth has a greater effect on both of these two meas- 
ures, Similar results have been secured for thirst, which has also been 
induced by injecting hypertonic saline into the ventricles of the brain 
and reduced by injecting water. In short, there seem to be a number 
of mechanisms—located in the mouth, stomach, and brain—for regulat- 
ing drive. 

Furthermore, exactly as the drive-reduction hypothesis would predict 
from the foregoing facts, food injected directly into the stomach will 
serve as a reward to cause rats to choose the correct side of a T maze, but 
food taken normally by mouth will serve as a stronger reward to produce 
more rapid learning. The rewarding effect of food injected directly into 
the stomach would not be expected from hypotheses that used either the 
pleasant taste or the consummatory response as the sole basis of rein- 
forcement. Conversely, the saccharine solution which Sheffield and Roby 
[100] proved was a reward for hungry animals has been found by us 
to reduce hunger [67]. 

Effects of electrical stimulation of the brain. Another series of studies 
has demonstrated the possibilities of motivating learning and performance 
by direct electrical stimulation of the brain [83, 23, 70]. In connection 
with these studies, I have discussed [71] the implications for theories of 
reinforcement of the reward effect discovered by Olds and Milner [92] 
and have shown how the drive-stimulus-reduction hypothesis can suggest 
significant new studies of central motivation and reward—studies which 
ultimately may lead to a superior hypothesis. A summary of my recent 
physiological studies of motivation has been published elsewhere [73]. 

Campbell’s psychophysical studies. The hypothesis that drives are 
strong stimuli and that sudden reductions in them produce reinforcement, 
has further demonstrated its usefulness by starting Byron Campbell on a 
series of studies which extend the psychophysical tradition to a new area 
by quantitatively studying the rewarding effects of sudden reductions in 
the strength of strong stimuli. Electric shocks and loud sounds have been 
used in these studies which show that, when the stimulation is reduced 
to zero, stronger initial strengths of stimulation produce greater learning 
and performance. But when the motivation is not reduced to zero, a 
given absolute amount of reduction is less effective the higher the initial 
level. Within the middle ranges, equal reinforcing effects seem to be pro- 
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duced by equal ratios of reduction, a relationship like Weber’s law, which 
breaks down at the extremes. Thus, what might be described as a “rein- 
forcement threshold” seems to show the same general type of curve as 
does the psychophysical threshold for a just noticeable difference, al- 
though much larger changes are required to produce a reinforcing effect 
17, 15, 16]. 

Relationship of drive to learning. A stimulus-response analysis can 
clarify some of the apparent confusion about the relationship of drive 
to learning [66]. In the practical teaching situation, everyone knows that 
poorly motivated students do not learn. This is chiefly because they do 
not expose themselves to the proper cues or perform the correct responses, 
for example, open their books and study. But most experiments on this 
topic have concerned themselves with a somewhat different problem: the 
relationship of drive to reinforcement. Therefore, they have tried to con- 
trol the factors of exposure to cues and number of responses by giving 
both groups the same number of trials in very simple situations, One 
must be careful in generalizing from this kind of experiment to the prac- 
tical situation; a film of mine [77] yields a better analogy. 

When the drive is promptly reduced to zero after the correct response, 
there is a perfect correlation between the strength of drive and the 
amount of drive reduction. Thus, according to the stimulus-reduction 
hypothesis, stronger drives should produce more reinforcement and better 
learning. These conditions are met in most experiments on escape learn- 
ing and eyelid conditioning. The results of such experiments agree with 
the theory.* 

In the work of Campbell and his collaborators, we have just seen 
that when the drive stimulus is reduced a constant absolute value, the 
amount of reinforcement follows the same Weber's law principle as other 
stimuli; it is less at the higher levels of drive. Under these circumstances, 
the intense-stimulus-reduction hypothesis would not expect a positive 
relationship between the strength of drive and amount of learning. 

In experiments in which the correct response is reinforced by pellets 
of food or sips of water, it is obvious that, at the extreme low end of the 
continuum, there will be little possibility of drive reduction if there is 
almost no drive present. And indeed, experiments on so-called latent 
learning seem to agree that extremely low drives ( whether all relevant 
primary and secondary drives are zero is controversial) seem to produce 
less learning than does higher drive. 

When we start comparing moderate drives with strong ones, however, 
we simply do not know what the relationship is between the strength of 


*It should be noted, however, that all such experiments have involved noxious 
stimuli so that one might claim that the positive results were produced by a 


learned drive, such as fear, which persisted from training to testing. 
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drive and the relative amount of drive reduction produced by a single 
pellet of food or sip of water. Therefore, it should not be too surprising 
that the results of such experiments are confusing. Even if the animal 
were fed to satiation at the end of a trial, the total amount of drive re- 
duction would be confounded with considerable delay. Perhaps tech- 
niques for manipulating hunger and thirst by electrical stimulation of the 
brain (or measuring them by central recording) will develop to the point 
where we can conduct decisive experiments on such drives. 

Spence [103] has clearly pointed out and documented an additional 
source of possible confusion, namely, that whenever an incorrect response 
is dominant at the beginning of learning, Hull’s [39] principle that drive 
multiplies the strength of habits predicts that increases in drive will in- 
crease its excitatory potential more than that of the subdominant correct 
responses, To this and other excellent points that he makes, I would add 
only that one must consider (1) possible innate responses to different 
strengths of drive, and (2) the possibility that rates and amplitudes are 
specific responses as would be expected from a micromolar analysis of the 
type expounded by Logan [49]. 

Large areas in which lack of suitable measures makes hypothesis 
inapplicable. In experiments using extremely strong stimuli, such as 
electric shocks, intense lights, or sounds that are under external control, 
the stimulus-reduction hypothesis clearly applies and is routinely verified. 
But perhaps the most damning thing that can be said about this hypoth- 
esis is that there are so many situations in which, with our present tech- 
niques, we cannot tell for certain whether the net effect of a given ex- 
perimental operation is to produce an increase or a decrease in the moti- 
vating stimulus. Although such limitations are not unique to this hypoth- 
esis, a clear-cut possibility of disproof would be preferable to the present 
ambiguity. For example, if the change from complete darkness to weak 
illumination is reinforcing, it can be argued that the small increase in 
visual stimulation is more than overbalanced by a reduction in boredom 
or fear. Similarly, it can be argued either that tiny pellets of food produce 
brief reductions in drive, or that they do not appreciably reduce hunger 
and may even whet the animal’s appetite. Guthrie [31] can argue that a 
boy banging on a drum is seeking stimulation and Harlow [32] can say 
the same for the curiosity of his monkeys. On the other hand, a plausible 
case can be made that both boy and monkey are escaping from the in- 
tolerable tension of boredom. Such ambiguities have motivated me to the 
program of research aimed at trying to learn more about the mechanisms 
of drive induction and reduction. Perhaps there are physiological ways of 
securing independent measures of at least certain drives, or behavioral 
techniques for trying to extend the correlation between satiation and re- 
inforcement to other drives such as curiosity [73, 91]. 
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Alternative hypotheses. Guthrie’s [31] version of the pure contiguity 
hypothesis is enormously appealing because of its simplicity and because 
it would be relatively easy to imagine a physiological mechanism that 
worked in this way. I have been unable to see, however, why it should 
not predict that a thoroughly learned response to a momentary stimulus 
will never extinguish. After a given response is learned to the point where 
it regularly is the first and last response to that stimulus, should not more 
and more of the atypical members of the population of cues be condi- 
tioned to the response so that it would become progressively more certain 
to occur? 

There is also the problem of why the stimulus-change produced by 
the onset of a painful stimulus does not protect preceding responses from 
retroactive inhibition. One might modify Guthrie’s hypothesis to avoid 
these difficulties by assuming that only the termination of a stimulus (and 
not changes produced by adding new cues) functions to protect responses 
from retroactive inhibition, and that such protection is proportional to 
the number of cue elements that are removed. Then, the effect of ter- 
minating a weak stimulus would be negligible because the change in the 
total population of cue elements would be so small, but the termination 
of a strong stimulus would be more effective because many more cue 
elements would be removed. With such modifications, Guthrie’s applica- 
tion of the contiguity hypothesis would become functionally very similar 
to the hypothesis that reinforcement is produced by the prompt reduction 
in a strong stimulus. 

Various two-factor theories have considerable attractiveness. It is 
easy to imagine a simple contiguity mechanism being evolved at an early 
stage and then supplemented with some sort of booster effect from re- 
ward or the escape from punishment. Indeed, it is quite possible that 
careful research on primitive organisms, or on more complex ones primi- 
tivized by removing higher segments of the brain, would show that it is 
possible for them to learn several responses by simple conditioning but 
not to select among them by trial and error, even when the proper condi- 
tions for trial and error learning are realized. 

Actually, the shift from a single factor to a multifactor theory need 
not be very disruptive to the general features of such miniature systems 
as I have proposed as long as most of the laws—gradient of reinforce- 
ment, stimulus generalization, experimental extinction, spontaneous re- 
covery, etc.—remain the same [25, p. 42]. These laws seem to be the 
same for classically conditioned autonomic responses as well as for trial- 
and-error somatic responses reinforced by either escape from electric 
shock or rewarding a hungry animal with food. 

To use a simple analogy, to change the lighting in an apartment 
house from electricity to gas, would be a major undertaking because the 
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two sources of energy follow completely different laws, and thus would 
require changes such as those from wires to pipes. A change from a-c to 
d-c current would involve relatively minor alterations, since these two 
types of current have relatively similar functional properties. A shift from 
alternating current delivered by a private company to alternating current 
delivered by a municipal one would involve only a small change in the 
connections to the basement (which, indeed, would be highly significant 
to those who sell the current from different sources), but there would be 
no changes in the wiring and appliances throughout the apartment house 
since the current from these two sources follows identical laws. 

Proponents of multiple-factor theories of learning can attract more 
serious attention if they can rigorously specify different laws associated 
with supposedly different types of learning. 

Cognitive theories. S-R theorists are confronted with the problem of 
explaining man’s obviously intelligent behavior; cognitive theorists are 
confronted with the problem of explaining obvious stupidity. Although 
recognizing that much of man’s behavior involves cognitions, I have 
preferred the strategy of trying to explain such behavior as the outgrowth 
of simpler, noncognitive mechanisms. It is hard for me to conceive of 
cognitive insight as the sole means of acquiring maladaptive neurotic 
symptoms or those many motor skills that seem to be almost entirely 
unconscious. 

One obvious alternative taken by many cognitive theorists is to 
assume different levels of learning—a lower S-R one to explain stupid 
behavior and a higher cognitive one to explain intelligent behavior. This 
may turn out to be the most parsimonious explanation possible, but the 
theorist will have to be careful to specify under what conditions both 
occur or else he will be limited to post hoc explanations. Another alter- 
native is to adopt the general pattern of Tolman’s theory—using as- 
sociation as the basic element of learning, and the empirical law of effect 
as the mechanism of activation—but stating the laws in terms of auto- 
matic processes. Spence’s [103] theory is tending in this direction. Varia- 
tions of this general pattern certainly should be rigorously explored. But 
one must be careful that the theory does not predict that, if rats or 
earthworms show any learning with motivation and reward, they should 
show approximately as good latent learning. 


New Trails for Research on Motivation® 


We are still a long way from a satisfactory understanding of motiva- 
tion. To take a simple example, much of our human social behavior is 
° Throughout this section I frequently use the term motivation to include the 


effects of both drive and reward, using the latter terms if I want to distinguish the 
two effects. 
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overdetermined by many different sources of drive and reinforcement. 
Thus, money is the focus of many needs; its possession the means to 
many rewards. But our experimental literature has scarcely begun [95] 
to tackle the problems of how different sources of drive and reward 
summate positively or negatively in the learning and maintenance of a 
given habit. 

Learned elicitation vs. learned channeling. Most of the motivation of 
adult human social behavior is either acquired by or profoundly modified 
by learning. I have called such motivations learnable drives and rewards 
to emphasize the fact that the basic physiological mechanism is probably 
innate, although the arousal or reduction in response to specific cues isa 
product of learning [64]. In the case of fear, it seems possible for part of 
the response innately elicited by pain (loud sounds and perhaps a con- 
siderable variety of other situations) to be conditioned to new cues so 
that it can be elicited in the complete absence of any obvious, relevant 
primary drive. Let us call this learned elicitation. 

I have assumed that the elicitation of a number of social drives 
might be learned in the same way as fear, for example, as a result of 
thousands of instances in which feeding and other sources of infant 
gratification are associated with the appearance of an approving parent. 
However, I have repeatedly failed to establish experimentally any ap- 
preciable learned elicitation of drive on the basis of primary drives such 
as hunger or thirst [cf. 91].”° 

Although this failure has not caused me to abandon completely the 
attempt to apply the fear-elicitation paradigm to appetitive drives, it has 
caused me to shift the emphasis in a new direction. As pointed out earlier 
[64], even the primary drives themselves may be profoundly modified 
by learning, so that hunger becomes the desire for a particular kind of 
food prepared according to culturally determined standards of what is 
appetizing. Thus, one must learn to like snails or oysters. But the elicita- 
tion of the appetite for such foods never becomes completely independent 
of hunger, since even these delicacies have little appeal after complete 
satiation by a Thanksgiving dinner: Furthermore, although the taste for 
snails and other exotic foods is learned, it is continually reinforced by 
eating them when hungry. Therefore, one does not need to worry about 
the problem of experimental extinction vs. functional autonomy. 


Tt is conceivable that this failure is due to the fact that these drives are 
aroused so slowly that one is always dealing with delayed, or even backward, condi- 
tioning. That is one of the reasons why I have been interested in exploring possi- 
bilities of eliciting drives rapidly by techniques such as electrical or chemical 
stimulation of the brain [73]. On the other hand, it may be that such failures are 
due to the absence of any mechanism for a response-produced drive like the one 
involved in fear. 
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It should be noted, however, that hunger probably can be channeled 
only to certain categories of objects, namely, foods. I venture that it 
would be much more difficult to develop and maintain an appetite for 
sawdust, Are there other drives that permit more latitude in channeling? 
What are the defining characteristics of their potential goal objects? 

The desire for a specific kind of food may be called learned channel- 
ing of drive. It may be contrasted with the fear of a previously neutral 
cue which we have just described as learned elicitation of drive. 

The foregoing two examples—channeling of hunger and elicitation 
of fear—may not be completely different, but represent the extremes of 
a continuum. An intermediate case could occur if there is a latent fear 
of some type of situation which can be further strengthened by learning. 
Or there might be an innate tendency to fear some general class of 
situations which can be made more specific by learning. Furthermore, we 
might find that the reaction to a conditioned fear stimulus would vary 
with the general level of anxiety of the subect and be influenced by 
physiological factors such as endocrine changes. 

Can rewards for one drive channel a different one? Another pos- 
sibility is that reinforcement by the goal objects of a given drive may help 
to determine the direction in which a second one is channeled.” If such 
channeling occurs, it might easily be mistaken for the establishment of 
learned elicitation, and/or secondary reinforcement that became in- 
dependent of the first drive. 

A possible illustration of such a channeled drive is the Freudian 
hypothesis that sex begins as a dependent drive (Anlehnungstriebe), so 
that a young man’s eventual object choice may be determined by earlier 
rewards in the nursing situation under the primary drive of hunger. If 
this were the case, it is clear that the young man’s attraction to his bride 
might continue to be reinforced by adult sexual (and other) gratifica- 
tions even though he discovered that she could not cook. 

To take a simpler example, Scott [98] observed that two bottle- 
fed sheep showed no tendency whatsoever to follow the rest of the herd 
whereas several thousand normally nursed controls all showed strong 
gregarious tendencies. This observation suggests that gregariousness in 
sheep may be influenced by learning during the nursing situation. But 
some special innate susceptibility or additional source of motivation must 
be involved since not all mammals are as gregarious as sheep. 

Similarly, a variety of obvious rewards, such as feeding, determine 
a dog’s attachment to a particular master or family. This attachment 
may be strong enough to motivate the dog to learn to respond to various 


“TJ use the more general phrase “reinforcement by goal objects” to emphasize 
the fact that this hypothesis is not necessarily tied to the drive-stimulus-reduction 
hypothesis of reinforcement. 
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cues by being ready to leap into the car so that he will not be left be- 
hind. But equally well-treated cats usually do not acquire a similar 
motivation. Such observations raise the possibility that, as part of the 
pattern of hunting in packs, evolution has produced in dogs some special 
motivational mechanism that can be channeled either to the pack or to 
the adoptive human family. 

The “imprinting” of ducks to follow people and other objects may 
be an extreme example of channeling. We have already suggested that 
this “motivation to follow” probably could be used as a drive for the 
trial-and-error learning of whatever response produced the release per- 
mitting the baby duck to follow its leader [64]. But the original behavior 
of the newly hatched duck in the absence of any suitable moving objects 
to follow suggests some strong original motivation, perhaps fear. Could 
this motivation also be studied? Would its reduction by a suitable object 
prove to be the source of reinforcement for the original imprinting? What 
role is played by feeding and other reinforcements which often are con- 
founded with “imprinting”? 

It is possible that the type of behavior which we have described 
as channeling is nothing more than an array of specific habits. Certainly 
the line between (or blending of) habit and motivation needs clarifica- 
tion as Brown [13] has ably argued. 

What I would like to point out is that such habits seem to have 
many of the functional properties of drives. Oysters for people, or novel 
foods for rats, do not function as effective rewards until the subject has 
learned to eat them. How much of this is a function of mere delay in 
seizing and swallowing? I venture that at least some of the effect may 
be due to the elimination of aversions by counterconditioning and the 
acquisition of secondary reinforcement. 

Furthermore, the aroma, sight, or even description of a particular 
delicacy seems to produce a considerable increase in the motivation of a 
moderately hungry person. I venture that, under appropriate circum- 
stances, it would also facilitate the trial-and-error learning and perform- 
ance of any response that circumvented a block to the direct approach 
to the incentive. Furthermore, Tinklepaugh’s [104] observations on 
monkeys and my own incidental observations of people suggest that, 
once a learned strong anticipatory goal response for a particular food is 
aroused, other foods may be less effective as rewards, Again the factor 
of delay in consumption needs to be controlled. - 

Finally, we need to know the laws governing the ways in which 
motivation can be channeled and/or elicited by new cues. For example, 
what conditions can inhibit the development of intellectual curiosity in 
children and what.conditions can enhance it? For that matter, what 
conditions can arrest, strengthen, or channel the curiosity of a cat? 
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Throughout the preceding discussion we have raised a number of 
theoretical possibilities. Although the animal examples cited should readily 
lend themselves to precise experimental analysis, such work has not 
been done, so that our discussion has had to be highly speculative; many 
different theoretical possibilities are open. 

What drives may be elicited or channeled by learning? Since the 
conditions of social and individual learning are complex and variable, 
we will expect learnable drives to exist in a baffling variety of com- 
binations in response to a baffling variety of cues. Thus, any standard 
list of socially learned motives will only be as simple and as stable as the 
conditions of learning involved. This may explain the baffling variety 
of such lists. 

One might go on to ask, however, what kind of different physiological 
mechanisms may underlie learnable drives? It seems plausible that fear 
might be the underlying mechanism in a number of instances that seem 
different because different conditions of training have caused different 
categories of cues to arouse the fear, and other specific categories of cues 
to serve as the goals because they, respectively, reduce these specific fears. 
Thus, it is interesting to note that we speak of the fear of failure, the fear 
of disapproval, the fear of losing money, status, or love. 

Similarly, Freudian theory presupposes that sex underlies a large 
number of superficially different motives. 

Do certain social motives, such as curiosity, gregariousness, and 
jealousy, all have a common mechanism, or does each have distinctive 
ones? Can such motives be elicited, channeled, or both? If they are 
channeled, what are the characteristics defining potential goals? We have 
scarcely made a beginning toward answering some of these questions. If 
we knew more about some of these other sources of motivation, we 
might not be so strongly tempted to place such a heavy theoretical 
burden on fear. We must solve such problems before we can begin to 
understand the wonderfully complex web of human social motivation. 

Need to study new types of drive; one way to do it. Most of the 
basic research on motivation to date has been confined to an extremely 
limited number of so-called primary drives such as hunger, thirst, sex, 
and pain. There is a great need to study new types of motivation in the 
laboratory. Our present list of experimentally studied motives is far too 
short. A promising break away from our old limitations has been made 
with studies of fear and more recent studies of “curosity” and “activity.” 
We need to extrapolate this new trend. 

A new view of fear. The old view of fear as an emotion was inferior 
in that it stressed innate and disrupting effects of fear and did not even 
give a clear picture of how these were achieved. The newer and much 
more powerful understanding is (a) that fear is a drive, like hunger OF 
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thirst, which may motivate either adaptive or maladaptive behavior, and 
(b) that a sudden reduction in the strength of fear serves as a reward 
to reinforce immediately preceding responses: i 

The point of departure for this new understanding of fear was 
Mowrer’s [86] trenchant stimulus-response analysis of Freud’s [26] in- 
sightful paper on the problem of anxiety. Mowrer’s pioneering work led 
me to set up an experimental demonstration that fear has a crucial 
functional property of drive in that it can motivate the trial-and-error 
learning and performance of whatever response is followed by a sudden 
reduction in the strength of fear [58, 60]. I believe that this same experi- 
mental paradigm that I applied to fear can be used to study many other 
sources of motivation. 

In my experiment on fear, albino rats were placed in a simple 
apparatus consisting of two compartments separated by a door. One was 
white with a grid as a floor; the other was black without a grid, Before 
training, the animals showed no marked preference for either compart- 
ment. Then, they were given a number of trials during which they re- 
ceived electric shock in the white compartment and escaped into the 
black compartment through the open door. After this training, the 
animals would run out of the white compartment, even if no shock was 
on the grid. 

Then, during additional trials without further shocks came the 
critical test to demonstrate that the rats’ running was not the mere 
persistence of a habit, but also had the crucial property of a drive, 
namely that it could motivate, and its reduction reinforce, the learning 
of a new habit. The door, previously always open, was closed. The only 
way that it could be opened was by rotating, by a fraction of a turn, a 
little wheel which was above the door. Under these conditions, the rats 
exhibited trial-and-error behavior and gradually learned to escape from 
the white compartment by promptly rotating the wheel. 

When the conditions were changed, so that only pressing a bar would 
open the door, wheel turning extinguished and a second, new habit (bar 
pressing) was learned. Thus, the fear, presumably established by the 
shocks in the white compartment, was shown to have the same functional 
properties as a drive such as pain in that it could motivate the trial-and- 
error performance of response and its reduction could reinforce the im- 
mediately preceding response. 

This new view that fear is a drive and that fear-reduction is a re- 
inforcement has many consequences. These have been developed in the 
course of studies in which I have been involved—investigations of be- 
havior in combat [24, 110, 65], and an analysis of neurosis and psycho- 
therapy [25]. In going beyond the experimental studies of animals in 
vital fear situations, I have chosen to look for “naturally” occurring 
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situations where people were confronted with vital dangers, rather than 
bringing subjects into the relatively safe laboratory and in effect merely 
saying “boo!” to them.” I shall have space to hint at only a few of the 
main points gleaned from such studies. Many of the ideas emerged from 
collaboration with John Dollard. 

As soon as fear is thought of as a drive, one notices that it can 
motivate desirable behavior, such as being alert and resourceful, driving 
slowly, looking for the source of danger and planning ways to minimize it 
—for example, buying insurance. Of course, fear can also motivate 
undesirable behavior such as cowardly running and hiding, cheating, and 
lying. The most important thing, then, is not how afraid a man is but 
what fear motivates him to do. 

Since a sudden reduction in the intensity of fear serves as a reward 
to strengthen responses, a frightened person will learn those responses he 
is making when his fear is reduced. If he eventually escapes from the fear 
by brave, adaptive behavior, he should learn to become more courageous 
and resourceful; if he temporarily reduces fear by cowardly and 
maladaptive behavior, he should learn to become more craven. If 
neurotic symptoms reduce fear, they will be learned. Other symptoms, 
such as ulcers, may be part of the direct physiological reaction to fear. 

One way of reducing fear is to turn away from, suppress, or repress 
the fear-arousing stimuli. When mild fears are reduced in this way, the 
subject learns to avoid unpleasant topics; when intense fears are involved, 
complete repression may be reinforced. It is obvious that avoiding look- 
ing at and thinking about possible sources of danger may actually in- 
crease the probability of disaster or decrease the possibility of making the 
discrimination that no significant danger exists. Furthermore, anyone 
who wants to use fear as a motivation must effectively emphasize the 
proper escapes from fear or run the danger of having his subjects find 
their own escape by avoiding him and forgetting his message. In using 
fear as a drive it is important to remember that the escape from fear is 
what reinforces learned behavior. 

Many additional practical points come out of the naturalistic study 
of fear, for example, the importance of knowing exactly what to expect 
or planning and knowing what to do to minimize the danger, of con- 
centrating on the task at hand, of breaking seemingly impossible tasks 
into manageable steps and concentrating on the successful performance 
of each step, the value of reassurance from the group or self-administered 
reassurance, the effectiveness of strong positive motivation and of re- 
wards (counterconditioning) for nonfearful behavior. Additional dis- 


2 My work in the Air Force during World War II convinced me of the diffi- 
culties of devising situations that would really frighten eager young aviation cadets 
without terrifying generals and congressmen. 
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cussion more oriented toward theory and experiment will be found in 
Miller [64]. 

Curiosity. In the course of a fruitless search for a learned drive based 
on hunger, Myers and Miller [91] applied to curiosity the same apparatus 
and theoretical paradigm that had been used in the study of fear. 
Curiosity had previously been brought into the laboratory from some- 
what different points of view by the ingenious work of Berlyne [8]; 
Harlow [32], and Montgomery [85]. In addition to applying the same 
operations used in the study of fear, Myers and I suggested the im- 
portance of determining whether a new drive, such as curiosity, shows 
the same pattern of deprivation-reinforcement-satiation as previously 
studied drives such as hunger. In other words, is there a correlation be- 
tween the operations of reinforcement and satiation, so that an event 
which increases the subsequent performance of a response when ad- 
ministered immediately after that response is also found to tend to 
decrease the performance of responses motivated by that drive when it is 
administered in sufficient amount immediately before testing those re- 
sponses?** 

Finally, Kagan and Berkun [41] have used the same general para- 
digm for studying “activity drive” and Zimbardo and Miller [112] have 
used it as the point of departure for studying the effect of hunger on 
curiosity. 

Drive elicited by perceptual mechanisms? The general approach can 
be further illustrated by another experiment. In an attempt to bridge 
the gap between gestalt theory, perceptual learning, and S-R theory, I 
wondered whether the vector toward a good gestalt had the properties of 
a drive. If it did, one might expect animals exposed to a bad gestalt to 
learn by trial and error a simple response that would cause it to change 
to a better one, The “bad gestalt” of a series of broken lines was projected 
in front of pigeons. Pecking on a response key caused this to change to 
the “good gestalt” of unbroken lines. For a control group, the same re- 
sponse caused a pattern of straight lines to change to one of broken 
lines, 

To circumvent certain practical difficulties, we started with trained 
birds, tried to measure any reinforcing effect by a difference in resistance 
to extinction, and introduced a number of controls for possible stimulus 
generalization between the situations of training and of extinction. 


“Tt should be noted that one can imagine special conditions under which the 
drive-stimulus-reduction hypothesis would not demand quite the same pattern as 
hunger seems to exhibit. Suppose pressing a bar turned off a shock which was 
almost immediately turned back on. There would be a correlation between the 
operations of reinforcement and satiation, but the period of “satiation” might be 
so brief that it easily could be missed. Furthermore, longer periods of “depriva- 
tion” might not produce greater amounts of recovery from “satiation.” 
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The difference, although in the right direction, was relatively small 
and unreliable. Perhaps a tendency toward a good gestalt cannot function 
as a drive, or perhaps we used a technique that elicited it only weakly, 
But the experiment illustrates how the method can be applied. 

With sufficiently ingenious apparatus, the same general method could 
be used to determine whether or not motivational effects are produced 
by sensory blurring, perceptual distortions, or the disruption of “phase 
sequences” that have been established by either perceptual experience or 
experimental training. 

This paper has already suggested a way of applying the same general 
paradigm to the experimental study of motivation possibly produced by 
conflict. 

Motivation for achievement. Observations of human and animal be- 
havior suggest many additional possible sources of motivation for experi- 
mental study. For example, a child that has just learned to turn over from 
its stomach to its back may cry until it is placed on its stomach so that it 
can practice this fascinating act again. Can trial-and-error learning be 
motivated by this “drive”? 

After the child has become skillful in turning over in both directions, 
he seems to become bored with this activity but strives persistently against 
many difficulties for other skills—to pull himself up, to stand, to walk, 
and eventually to ski. What is the drive and reward for each of these 
skills? Is there a separate drive for every maturing potential skill or is 
there some mechanism to produce in infants what might be loosely labeled 
as a general drive toward achievement? If so, what kind of mechanism 
could possibly produce this result? How much of the adult motive is 
attributable to rewards that channel such a drive and how much to some 
drive that is elicited by social punishments for failure to improve and 
rewards for improving? 

Conceivably, the blocking of almost any response tends to induce 
motivation and the partial or complete occurrence of the blocked re- 
sponse tends to relieve the motivation and function as a reward. This 
hypothesis might be checked by eliciting movements, such as circling, by 
central stimulation and determining whether such stimulation can 
motivate learning without any additional rewards, if, and only if, the 
movements are first blocked and then released. It is just barely con- 
ceivable that such a mechanism might be one step toward the develop- 
ment of a motivation that would have some of the functions of a drive 
to achieve. 

Certain ideas as motivation. Similarly, observation of infants strongly 
suggests that they have something like definite ideas and that these can 
serve as transient but strong motivations. I have observed a child under 
the age of nine months seem to get the idea that he wanted to get a 
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certain tray off a low shelf, struggle vigorously this way and that until he 
finally succeeded, and then show a look of joy and triumph. k 

In the adult it is obvious that, as the definite result of social training, 
certain thoughts, such as “I may have cancer,” can elicit strong motiva- 
tion. I have described such motives as mediated learned drives [64]. 
Could this mechanism have been involved in the infant, and if so, how, 
or was some more basic mechanism involved, and if so, what? 

Perhaps, as I have suggested in a discussion of goal-directed drives 
[64], the discrepancy between an anticipatory goal response (or certain 
other ideas) and the current state of affairs can be the basis for motiva- 
tion. Again, we need to know far more empirical facts aimed at a 
penetrating analysis of the details of such behavior and its possible 
mechanisms. 

Possible response characteristics in infant hunger. Let me conclude 
with one more concrete example. I have observed an infant during the 
first two weeks of life change within a few minutes from a state of 
relative quiescence to one of crying, reddening, and extreme activation 
so that my first reaction was to search for a jabbing pin. No such obvious 
source of activation was found, but when given a bottle, he drank 
vigorously and quieted down. If the milk in the bottle ran out, he be- 
came extremely activated, but by the time a new bottle could be heated, 
he sometimes had quieted down, was out of the mood, and refused it. 

The suddenness of these shifts contrasts with what might be expected 
from the gradual accumulation (or restoration) of a physiological deficit; 
it suggests that hunger may have some of the properties of a response, 
being inhibited until it finally breaks through and then strengthens 
and maintains itself by positive feedback. On the other hand, perhaps it 
is not the hunger itself, but only the anticipatory goal responses or overt 
responses to the hunger, that have this characteristic. 

In any event, it should be reasonably straightforward and profitable 
to devise objective techniques for recording the reactions of human or 
animal infants to hunger. Then one could systematically study the effects 
of a few simple controlled experimental operations of the type that fre- 
quently occur by chance in the natural environment. 

Additional problems. In conclusion, let me emphasize that the fore- 
going examples are but a small sample of the types of naturalistic ob- 
servations which point up the shortcomings of our current understanding 
of motivation, Each such observation could serve as the point of de- 
parture for a new line of research. I have made a number of suggestions, 
but these are meant more to point up the problems than to serve as 
definitive solutions to them. Many other problems remain to be raised. 
For example, some motivations in some people seem to be relatively 
central in that they cannot be changed without a profound effect on the 
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entire personality; others are peripheral and can be changed with little 
effect. What are the principles and conditions that cause different motiva- 
tions to be central with different individuals? Why are some learned 
drives amazingly resistant to change, whereas others change easily? How 
are hierarchies of learned drives and rewards built up, and how do they 
interact? 


SOCIAL LEARNING, PERSONALITY, AND PSYCHOTHERAPY 


I have already pointed out that my collaboration with John Dollard 
in applying learning principles to social situations outside of the labora- 
tory has posed problems which helped to liberalize and extend the prin- 
ciples. One of the reasons for this is that, in dealing with social behavior, 
one cannot select simple cues and responses which allow one to ignore 
vexing problems. Another reason is that the concrete examples of real 
social behavior often involve new phenomena which would not be called 
to one’s attention by work restricted to the laboratory. The attempt to 
analyze these new phenomena in terms of the principles derived from the 
laboratory often suggests new kinds of experiments to perform. 

Adequate treatments of the attempts to apply learning and behavior 
theory to more complex social phenomena would be another major task 
requiring a separate paper; only a few of the main points can be indi- 
cated here. Fortunately, events have moved swiftly since the publication 
of our books on these topics so that many of their contributions are be- 
coming part of the Zeitgeist. 

The book on Social Learning and Imitation [76] clarified a relation- 
ship between psychology and various social sciences. In order to be able 
to predict any specific item of learned behavior, one must know not only 
the principles, but also the conditions of learning, To take a simple ex- 
ample, one could know everything about principles, such as the gradient 
of reward, but be completely unable to predict which way a rat would 
turn in a T maze unless one knew also the relevant conditions, namely, 
which arm of the maze contained the reward and which arm was wired 
for electric shock. With human social behavior, psychology provides the 
principles of learning and of innate drives, cues, rewards, and responses. 
The other social sciences, such as sociology and social anthropology, de- 
scribe the conditions of learning—or in other words, the location of the 
rewards, punishments, and other conditions of the social maze. One must 
know both the psychological principles and the social conditions in order 
to predict human behavior. 

Furthermore, the present conditions of learning are the product of 
prior conditions plus the principles of learning, so that a knowledge of 
psychological principles helps one to understand current conditions. 
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This book also illustrates a process of analyzing a concrete example 
of social behavior, the imitative behavior of children in the family setting, 
and of designing experiments on animals and on children to test the 
theoretical analysis. Then, the analysis is applied to more complex forms 
of behavior, such as the prestige of models. Additional experiments are 
performed to test this further application of the principles. Finally, ap- 
plications are made to yet more complex phenomena such as crowd be- 
havior and copying in the diffusion of cultures. 

In the course of this work, it is necessary to emphasize learnable 
drives and rewards and the higher mental processes. 

We also encounter problems and data which suggest new hypotheses. 
I shall mention only two, related to the learning of language. One of 
these is that the early vocal behavior of the infant is in part generalized 
from his crying behavior, which is instrumental in bringing rewarding 
help from the parents. The other supplementary (not alternative) hy- 
pothesis is that, through manifold association with rewards of loving care, 
the phonemes of the parents’ language acquire secondary reinforcement 
value for the infant, and that this learned reward value generalizes to the 
sounds which the child makes when he is babbling. This generalized sec- 
ondary reinforcement helps to reinforce his babbling behavior, to shape 
it into patterns that are more likely to approximate words in his lan- 
guage, and to reward the imitation of sounds. We suggest that this hy- 
pothesis could be tested by studying the babbling behavior of children 
after an attempt has been made to give certain phonemes special learned 
reward value by associating them with rewarding care of the child, 
while different phonemes have been presented equally frequently, but 
in a neutral or aversive situation. Other hypotheses about the learning 
and functions of language are also presented (76, pp. 80-84]. n 

In separate papers the same type of learning-theory analysis in terms 
of drive, cue, response, and reward has been applied to advertising [62] 
and to the application of newer training methods, such as graphic com- 
munication, to the problems of education [74]. The latter publication 
contains a special section on Scientific Principles for Maximum Learning 
from Motion Pictures, which deals with a considerable variety of the- 
oretical and experimental problems such as hypotheses about changing 
motivations and attitudes, how to secure maximum abstraction and gen- 
eralization to new situations, how to teach students to discriminate and 
perceive better, and the superiority of logical over rote learning. 

The second book, Personality and Psychotherapy [25] presents a closer 
integration of three great scientific traditions. One of these is psycho- 
analysis, initiated by the genius of Freud and carried on by his many able 
students in the art of psychotherapy. Psychotherapy has much to contrib- 
ute because it represents a unique opportunity for the naturalistic obser- 
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vation of people responding emotionally and using their higher mental 
processes while struggling with problems which are vitally serious to 
them. 

Another tradition stems from the work of Pavloy, Thorndike, Hull, 
and a host of other experimentalists who have applied the exactness of 
natural science method to the study of the principles of learning. Finally, 
modern social science is crucial because it describes the social conditions 
under which a human being learns. The ultimate goal toward which we 
aimed was to combine the vitality of psychoanalysis, the rigor of the 
natural science laboratory, and the facts of culture. A psychology of this 
kind eventually should occupy a fundamental position in the social 
sciences and humanities, making it unnecessary for each of them to in- 
vent its own special assumptions about human nature. 

One of our points of departure was the assumption that, if functional 
symptoms of neurosis are acquired, they must be acquired either by 
known laws of learning or by other laws that have not yet been studied 
in the laboratory, but should be investigated there. Similarly, we viewed 
psychotherapy as a process of emotional reeducation involving the un- 
learning of maladaptive responses and the learning of more adaptive 
ones. 

Starting from these premises, we attempted to show in detail how 
various symptoms are learned because they are reinforced by reduction 
in fear (guilt and other drives), just as the turning of a wheel by a rat 
is automatically reinforced by escape from the fear-provoking box in 
which he has received electric shocks. Our analysis was most clear-cut 
and convincing in the combat neuroses because the conditions of learn- 
ing are more recent and better known in such cases. This analysis showed 
why the acquisition of the symptom should be accompanied by belle 
indifférence, and its interruption should produce misery which frequently 
results in the learning of a substitute symptom, just as the rat learns to 
press the bar when the apparatus is changed so that turning the wheel 
can no longer be used to reduce his fear. 

Other symptoms may not be the direct products of learning but 
rather the physiological effects of chronic high drive produced by 
conflict. The present paper has already mentioned a few of the ap- 
plications of conflict theory to neurosis and psychotherapy. 

We also made a detailed analysis of repression which hinged on three 
main points: (1) that the higher mental processes involve cue-producing 
responses—many of which may be central; (2) that strong aversive 
drives, such as fear, can be attached to these response-produced cues; 
and (3) that the sudden reduction in drive when the person stops 
thinking these painful thoughts can reinforce the response responsible for 
stopping the thought. From this, it follows that there should be a 
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tendency to suppress painful thoughts—for example, shy away from the 
topic of the H-bomb—and if the aversive motivation is strong enough, 
this automatic tendency should become overpowering and lead to com- 
plete repression. But since the repressed cue-producing responses are 
essential components of thought, reasoning, foresight, planning, and 
social control, it follows that those aspects of behavior involved in 
repression should become more childlike, primitive, and stupid. Similarly, 
the recovery of repression should meet resistance, but once the cue- 
producing responses are fully restored, behavior should become more 
adult, intelligent, and discriminating. This is the barest outline of the 
pattern of S-R reinforcement analysis which helps to explain psycho- 
analytic observations. 

Although the overcoming of repression is both a sign that significant 
reductions in anxiety have occurred and a means to more intelligent be- 
havior, we point out that it is a mistake to believe that the main goal 
of psychotherapy is to throw off all restraints. We attempt to show in 
detail exactly how intelligent, socially necessary suppressions and re- 
straints are substituted for blind, unrealistic, and frequently ineffective 
repressions and inhibitions. 

Further theoretical analysis shows the importance of experimental ex- 
tinction and counterconditioning during psychotherapy, and also the 
importance of a process whereby the patient learns to discriminate be- 
tween the conditions that had established unrealistic fears in his past and 
the altered conditions of his present circumstances. Thus we see why 
mere catharsis—as in an explosion of drunken rage—is not necessarily 
therapeutic. A chapter on drugs shows how we should expect them to 
help in therapy and what limitations we may expect them to have. 

By demonstrating in detail how the acquisition of a functional 
neurosis is a process of learning and how psychotherapy is a process of 
relearning, we show that these phenomena fall within the natural realm 
of the psychologist both as a research scientist and as an educator, or in 
this case, psychotherapist. 

Although we emphasize the role of learning, our formulation does 
not exclude the role of innate psychological mechanisms involving factors 
such as the innate strength of various drives, the ability to withstand 
various types of pain and stress, the sensitivity to different stimuli, the 
repertoire of innate responses which is available, the relative dominance 
of different responses, and native intelligence. The interactions among in- 
nate, learned, and social factors are further discussed in a separate 
paper [69]. 

We show that childhood experiences are important because of the 
helplessness of the child, but that neurosis can also be created in adult- 
hood under similar conditions of helplessness in combat. We attempt 
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to analyze the conditions of child-rearing which are especially likely to 
produce neurotic conflict. In this realm, however, we would like to em- 
phasize that we do not yet have a “science” of child-rearing. We believe 
that, in the long run, the development of better scientific understanding 
of child development would be enormously more valuable than efforts at 
repairing the damage. 

Our interest is primarily not in abnormal behavior for its own sake, 
but rather as a type of behavior which, by exaggeration, illustrates proc- 
esses that are significant in normal behavior and the normal personality. 
We also believe that it would be highly profitable to try to use similar 
methods to study intensively the behavior of especially well-adjusted 
people. 

Finally, we have been impressed with the fact that the normal use of 
the mind in solving material problems is enormously aided by the cultural 
and scientific heritage of techniques, such as mathematics and physical 
sciences, which are taught in the schools. Thus, most school boys today 
can readily solve “material” problems which baffled the best minds of the 
ancient world. By contrast, we have learned much less about special tech- 
niques for solving social and emotional problems. We look forward to 
the day when knowledge in this area has advanced to the point where 
students may be taught, in the home and in the school, techniques for 
solving social and emotional problems which are as effective as those man 
has discovered for solving the problems of his physical environment. 


SOME BASIC METHODOLOGICAL QUESTIONS 


Experimental design required but seldom used to justify intervening 
variables. Figure 11 shows that it is simpler to represent the relationship 


Hours of 
deprivation A. Rote of bor pressing 
Hours of 
deprivation Thirst A. Rote of bar pressing 


Fic. 11. When only two variables are involved, it is 
simpler to represent the direct relationship between 
them than to insert an intervening variable, such as 
thirst. 


between the effect of a single experimental manipulation on a single 
measure as a direct relationship between the independent and dependent 
variable. An intervening variable only complicates matters by forcing one 
to use two functional relationships (represented by the arrows) instead of 
one. 

If we multiply the number of experimental manipulations but use 
only one measure (or multiply the number of measures, but only use 
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one experimental manipulation), the same comparison holds: inserting 
an intervening variable only complicates the interpretation, In this con- 
nection, it is interesting to note that one of the leading opponents of the 
use of intervening variables [101] has limited himself almost exclusively 
to a single measure, the rate of bar pressing. 

In Fig. 12 we can see that the break-even point is reached when we 
compare the effects of two different experimental operations on two dif- 
ferent empirical measures. In that case, either type of formulation in- 
volves the assumption of four functional relationships. 

Although it is obvious that an intervening variable can always be in- 
serted into the middle of a single relationship, we shall see that the design 
illustrated in Fig, 12 gives us our first chance of proving that a single 
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Hours of 
deprivation — A. Rote of bor pressing 
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BAL B. Volume of water drunk 


Fig. 12. In expressing all the relationships among two 
independent variables and two dependent variables, the 
use of an intervening variable, such as thirst, does not 
increase the number of functions that must be assumed 
and may have certain advantages. 


intervening variable cannot account for the data. It is this possibility of 
disproof that makes the formulation meaningful, Although many be- 
havior theorists have used intervening variables, there are relatively few 
experiments which use the design required to test and justify such 
variables, 

Figure 13 shows that using an intervening variable produces a real 
gain in efficiency when one is dealing with the effects of more than two 
experimental manipulations on more than two empirical measures, With 
three of each, nine direct relationships are involved, but only six via the 
intervening variable. 2 

Looking at the intervening variable in Fig. 13, it is obvious that 
if a given number of hours of water deprivation is more effective than 
eating a given amount of dry food in producing thirst as measured by the 
rate of bar pressing, it should also be more effective in inducing thirst 
as measured by the volume of water drunk. This theoretically demanded 
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relationship does not, however, necessarily have to follow empirically. 
For example, it is conceivable that the deprivation would elicit less 
drinking than the dry food. If we got such discrepant results, we could 
not interpret them in terms of a single intervening variable. In other 
words, this type of design yields the possibility of disproving the 
hypothesis. 

I have been interested in applying the appropriate type of experi- 
mental design to test whether some of the simplest of the situations in 
which we commonly assume intervening variables can actually be ac- 
counted for in terms of a single such variable. In many cases the different 


Hours of 

deprivation. er A. Rate of bar pressing 

Feeding I 2, 

dry food Say ee ee B.Volume of water drunk 
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stop drinking 

Hours of 

deprivation A Rote of bar pressing 

Feeding 

dry food ————* Thirst ———~ B. Volume of water drunk 

Saline 

injection C. Quinine required to 


stop drinking 


Fic. 13. In summarizing all the relationships among 
three independent and three dependent variables, the 
use of an intervening variable can, under appropriate 
circumstances, be considerably more efficient than 
stating each of the direct relationships. 


measures show the type of agreement that would be expected if they were 
all pure measures of the same intervening variable [67]. But in other 
cases there is disagreement [68, 73]. 

For example, Fig. 14 shows one type of tentative result which one 
of my students, Martin Choy [20], has secured. The experimental treat- 
ment was allowing hungry animals, which had water available but had 
been deprived of food for 23 hrs, to eat dry food without water for 15 
min and then giving them 5 cc of 13.2 per cent saline by stomach tube. 
It can be seen that this treatment reduced the rate of bar pressing rein- 
forced by water on a variable-interval schedule, but increased the amount 
of water drunk and the amount of quinine required to stop the rats from 
drinking. 
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In this case, we might assume (1) that the combination of a stomach 
full of dry food plus strong salt solution produces pain, (2) that bar 
pressing is an impure measure which can be affected by pain as well as 
by thirst, and (3) that the bar pressing is affected more by the distraction 
of pain than are the other two measures of thirst. As long as these re- 
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C= Mock injection 

E=15 min. dry food + 5 cc 13.2% saline 
Fic. 14. How three measures of “thirst” are affected 
by the experimental procedure of feeding dry food 
without water to hungry rats and then administering 
5 cc of 13.2 per cent saline via stomach tube. The 
effect on bar pressing is opposite to that on the other 
two measures. From Choy [20]. 
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Fic. 15. Diagrammatic representation of how the dis- 
crepant results in Fig. 14 might be explained in terms 
of two intervening variables. 


main untested assumptions introduced merely to explain the anomalous 
results, I shall remain dissatisfied. But it is possible to test these as- 
sumptions by using some other agent, such as an electric shock, which 
presumably produces pain. The assumptions will be confirmed if the 
other agent produces a larger reduction in the rate of bar pressing than 
in the other two measures of thirst. I plan to perform this experiment. 
The theoretical assumptions are summarized diagrammatically in Fig. 15. 
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Impure measures vs. specific factors. The foregoing explanation 
assumes that thirst is a unitary intervening variable but the measures 
are impure and therefore can be influenced by factors other than thirst. 
But it is also possible that thirst is not a single unitary variable. There 
could be a number of different centers for different subfunctions. For 
example, the center, or centers, primarily regulating volume drunk might 
be especially responsive to the mouth and throat factors that are 
eliminated when water is injected directly into the stomach. It is quite 
possible that the general trend toward encephalization as higher forms 
evolve could result in the regulation of “the same drive” by a number 
of centers which are at different levels in the brainstem, emphasize some- 
what different functions, and may not be completely integrated into a 
system that always functions as a unit. 

It can be seen that if behavior theoriests start performing the types 
of experiments that will really test their intervening variables, we are 
likely to find ourselves following along in the footsteps of the aptitude 
testers who long since have run into the problem of general vs. specific 
factors. Perhaps we shall eventually be using a form of matrix algebra, 
somewhat analogous to multiple-factor analysis, but adapted to the some- 
what different conditions of our experimental designs. In the meantime, 
my first inclination will be to try to purify the measures and experi- 
mentally isolate the variables. 

Implications for psychopharmacology and for central stimulation. 
One of the practical implications of this line of work is that in studying 
the effects of drugs on motivation (or other intervening variables) it will 
be desirable to use a variety of measures in order to avoid being misled 
by side-effects that are specific to the particular indicators used [68, 72]. 
The same papers that make this point with additional examples also dis- 
cuss the advantages and disadvantages of a number of techniques for 
measuring motivation and suggest some possible new techniques. 

Exactly the same argument holds in the study of unusual techniques, 
such as direct electrical or chemical stimulation of the brain, for 
manipulating motivations. It is necessary to use rigorous behavioral tests 
to determine, point by point, whether the reaction elicited by such 
techniques has all the functional properties of normal drive induction 
or reduction [70, 73]. 

Some problems of quantification. Other things equal, there are 
many advantages to a theory that is at a high level of quantification. One 
of the greatest difficulties of a qualitative theory is that it is often unable 
to handle situations in which two or more factors are working in opposite 
directions. In such cases, the outcome depends on the relative strength 
of the two factors and a qualitative theory often has no way of predicting 
which of the two factors will be stronger. By contrast, a quantitative 
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theory often can deal with the absolute magnitudes with enough precision 
to make a definite prediction. 

Furthermore the qualitative theory is limited to predictions that a 
given operation will produce an increase or a decrease. Since there is 
a 50 per cent chance of getting one of these effects for some irrelevant 
reason, a single confirmation is not impressive. By contrast, when a more 
quantitative theory can predict that light should be bent by exactly 3°, 
there are many more chances of disproof. Since exact agreement is un- 
likely to be produced by chance, it is extremely convincing. Quantitative 
discrepancies can quickly show up defects in the theory and pave the 
way toward correction. For these and other reasons, science has always 
striven for the high levels of quantification represented by cardinal scales. 

Strategy of using qualitative tests first. In spite of being painfully 
aware of the disadvantages of a qualitative theory limited to ordinal 
scales and predictions of “greater than,” I believe that there is some 
virtue in the strategy of putting one’s theoretical notions through qualita- 
tive tests first before plunging into laborious attempts to quantify them. 
There certainly is no virtue in the misleading trappings of pseudo- 
quantification. I do not maintain that my strategy is necessarily the best; 
perhaps a more direct approach to quantification is desirable. We will 
never know unless someone tries it, as indeed a number of excellent 
people are. 

My main emphasis has been on trying to be rigorous though qualita- 
tive. Remember the discussion of the relativity of the distinction between 
empirical independent variables, systematic independent variables, and 
intervening variables. There is no way of determining a priori the 
scientific usefulness of such variables. Therefore, I believe it is best to use 
the qualitative measures that are available to test the utility of one’s main 
variables—to determine the approximate functional laws, the generality 
of these laws, and the unitary nature of the variables—before investing 
too much time in trying to quantify them. We have already discussed 
the design required to test the unity of intervening variables and com- 
mented on the fact that such tests are seldom made. 

Effects of inequalities in scale units. Our present behavioral scales are 
ordinal ones; we have no assurance that the intervals in various parts 
of the scale are equal. A keen awareness of the effects that can be pro- 
duced when the units at one end of the scale are systematically com- 
pressed generates a healthy caution. k 

A good example comes from Judson Brown’s [11] experiment 
which we have already described. He studied the effect of strength of 
hunger on the gradient of stimulus generalization. Hungry rats were 
trained to run down a short path to get food in front of an illuminated 
panel. Then they were tested either under the same illumination used in 
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training, or a different amount of illumination, Such tests were given 
either with a 46-hr or a 1-hr drive. Other factors were balanced. Two 
different measures were used: (a) the time taken to run from the start 
of the path to the middle, and (b) the force in grams with which the 
animals pulled against a harness which restrained them for 5 sec when 
they were halfway to food. 

The results are presented in Fig. 16, Brown presented his results in 
terms of starting time (A); we are also presenting the curves for the 
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Fic, 16, Illustration of the way in which different scales of measurement may affect 
data, The gradients of stimulus generalization plotted in time scores appear tO 
diverge widely. When the same data are replotted as speed scores, the gradients 
are approximately parallel, as they also are when measured by the strength-of- 
pull technique. Data from Brown [11]. 


same data transformed to speed scores (B). The fact that the curves seem 
to slope in opposite directions is not significant since increased time oF 
decreased speed both represent weaker responses. But in another respect, 
the same data plotted in these two different scales seem to show quite 
different results. With the time scores, the two curves diverge markedly; 
the change in drive seems to have a much greater effect on tests with 
the different stimulus than on those with the original one. By contrast, 
with the speed scores the two curves are approximately parallel; the 
change in drive seems to have equal effects on the different and original 
stimuli, This striking difference is produced entirely by the fact that 
there is a curvilinear relationship between the units in these two scales. 
Turning now to the strength of pull (C), it is obvious that these 


Extensions of Liberalized S-R Theory 283 


scores show a pattern which is quite similar to that of the speed scores, 
Because of this agreement, the fact that speed probably is more closely 
related to energy requirements than is time, and the fact that the speed 
scores are more normally distributed, I have long since changed from the 
use of time to speed scores. But it is quite possible that speed scores are 
far from ideal; they may distort relationships at both ends of the scale. 

It is instructive to think of the upper and lower limits of various 
types of measures of performance. Frequency scores cannot go beyond 
zero or 100 per cent, which places a certain limit on probability models. 
There probably is an upper physiological limit for speed. Thus, a man 
chased by one vicious dog might be more frightened if ten more suddenly 
appeared, but not run any faster. At the low end, a scale of speed of 
running may also be extremely insensitive, but for another reason. The 
speed scores are so near to zero that multiplying the time by a factor of 
even 100 can only reduce them by a fraction of a unit. Similarly, 
amplitude scales reach the limit of movement of the member measured 
force scales reach the limit of strength. As these latter scales approach 
their limits, they probably tend to become progressively more compressed, 

No type of operation will assure equal-interval scale. One might think 
that certain types of scale-making operations will insure the equality of 
the units, But I see no way of deciding a priori that any given operation 
is the correct one. For example, we might use a beam balance to deter- 
mine equal weights, reversing the position of the weights on the pan to 
check the balance. But if the equal weights were hydrogen atoms and 
we packed four of them closely enough together to form a helium atom, 
it would turn out to be simplest to assume that the atoms had lost 
something when they were packed this closely together so that four of 
them did not weigh four times as much as a single one. 

To use another example, if one did not control temperature, a steel 
ruler would become somewhat shorter as one moved from the equator 
toward the poles. It is only the fact that rulers do not expand and con- 
tract in much more capricious ways that makes them useful in estab- 
lishing equal-interval scales. If one were limited to an a priori logical 
analysis of the operations, one could never be sure of the relative 
constancy of the measuring rod which is essential to equal-interval scales 
of distance. 

Similarly, one could not predict that water would make a poor 
thermometer because it would change from progressive contraction to ex- 
pansion as it was cooled to near its freezing point. But how can one tell 
whether the reversal is in the behavior of the water or in the effects of 
our operations for changing temperature? As long as we are dealing with 
a single relationship, there is little to restrict the theoretical possibilities. 
It is only when one compares the expansion and contraction of a number 
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of different substances (or cross-checks with yet other phenomena vary- 
ing with temperature) that one finds that it is simpler to use hydrogen 
as the thermometer and assume that there are anomalies in the con- 
traction of water. In other words, the ultimate choice of a given set of 
units in a scale must be based on whether or not it yields the simplest 
matrix of relationships, or in short, the simplest laws. 

Within the behavioral field, Spence’s [103] characteristically excellent 
program of quantification looks promising because it seems to be yield- 
ing a series of reasonably simple laws and predictions which he is be- 
ginning to cross-check with a variety of different tests. 

Effect of carrying variables through entire range. It often is con- 
ceptually useful to imagine the effects of extrapolating a variable to each 
of its extreme limits. Similarly, it may be useful to try to determine 
empirically the effects at each limit. For example, one of the tentative 
results of Spence’s [103] program, which we have already quoted with 
strong approval, is that in simple instrumental learning situations, drive 
D, incentive K, and immediacy of reward J seem to combine additively 
with each other but multiplicatively with the number of training trials, 
or in other words, habit strength H. The equation for performance E is 
E=H(D +K +J). 

Now let us see what this means if we take each of the variables down 
to the lower extreme limit of zero. Performance will be zero if either 
habit strength H or the sum of drive, plus incentive, plus immediacy of 
reward (D + K + J) is zero. But with high values of the other variables, 
we might expect considerable performance with either zero drive, zero 
incentive, or zero immediacy of reward (infinite delay). It seems rather 
unlikely that such an outcome would be experimentally confirmed in a 
test situation that was carefully designed to minimize uncontrolled 
sources of drive and reward produced by motives such as curiosity. But 
on the other hand, the very fact that the prediction seems to go against 
common-sense expectations would make its possible confirmation a 
brilliant coup for the theory. 

Turning our attention to Spence’s [103] speed data, we see that the 
evidence for an additive combination is parallel curves, or in other words, 
zero interaction; the evidence for a multiplicative type of combination is 
diverging curves, or in other words, interaction among variables. In these 
experiments, it is interesting to note that none of the variables which fail 
to show interaction (drive, amount of incentive, immediacy of reward) 
is carried down to a level approaching zero. Thus, there is less op- 
portunity for them to show diverging curves, or in other words, inter- 
action. The one variable which shows the interaction, interpreted as a 
multiplicative relationship, is habit strength (i.e., number of training 
trials). This is the variable that is carried down to near zero, thus having 
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the most opportunity to show interaction, especially if the speed units 
should be compressed at the extreme lower end of the scale, 

The next step would seem to be to determine how the variables com- 
bine at the extremes, unless one wants to make the assumption that speed 
yields an equal-interval scale only in the middle ranges. But if one makes 
that assumption and therefore excludes the extreme end of the scale, 
one should avoid extremely low levels of performance produced by near- 
zero habit strength as well as those produced by near-zero drive, no in- 
centive, or long delay. 

Gradually approximating the exact form of functions, although 
limited to qualitative tests. Acknowledging the fact that our present scales 
are merely ordinal does not necessarily force one into a defeatist position. 
It can motivate one to try to devise the types of tests that will yield 
definitive results within the limitations of qualitative scales, 

To use an illustration which has already been discussed, suppose one 
is interested in determining whether one gradient of stimulus generaliza- 
tion is steeper than another. With an equal-interval scale, it would not 
make much difference if the two gradients were at different heights. But 
if the scale were compressed at the upper limit, the higher gradient might 
appear flatter merely because of this distortion in the units. That is why 
we chose our experimental conditions so that the two gradients actually 
crossed—a relationship which proves a difference in slope even if the 
units on the scale should be unequal. 

The algebra of inequalities is another technique for making rigorous 
deductions that do not depend upon the equal-interval scale. One of 
my earlier studies [81] used this technique to make first approximations 
of the general forms of a function that would have to be assumed in 
order to allow different competing sets of assumptions to account for all 
of the data in some experiments." Other applications might be made to 
other problems. 

A somewhat different example comes from Hull [39, Chap. 10]. On 
the basis of Yoshioka’s [111] work on the ease of setting up preference 
for the shorter of two paths in a vareity of mazes, Hull concluded that 
a logarithmic function for the gradient of reinforcement would not work 
and therefore shifted to a growth function. Although each of Yoshioka’s 
tests involved only the determination of which group was better (i.e., 
a qualitative relationship not demanding an equal-interval scale for 
excitatory tendency), Hull was able to use these tests to decide between 
two types of functional relationships. Actually, Hull leaped to a more 
specific function that the data justified. Yoshioka’s tests did eliminate the 


“ Since I was out of the country and did not get to see the proof, two errors 
crept into p. 195 of the article. In its corrected form, Eq. (3) should read 
E+14 X= D, and footnote 12 should read D > Ds > Daz and D > Da > Das. 
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logarithmic function; his tests and other data limited the type of rela- 
tionship to functions with certain specifiable characteristics which are 
met by growth curves, but which could also be satisfied by certain devia- 
tions from exact growth curves. Nevertheless, this approximation of a 
general type of functional relationship, before having an equal-interval 
scale, is an interesting model. 

Similarly, in my article on Displacement [61, p. 171] I show in some 
detail that some deductions will follow from any curves which have a 
negative slope throughout, other deductions demand somewhat more re- 
stricted classes of curves, and still other deductions demand curves with 
quite specific characteristics, namely curves that are concave and that are 
raised by increases in drive in such a way that the absolute increase at 
any point is some positive function of the original height of the curve at 
that point. All of these deductions can be tested by qualitative experi- 
ments. They seem to be confirmed by clinical data; their more rigorous 
verification by experimental data would be another example of specifying 
a considerable number of the quantitative characteristics of a functional 
relationship before developing an equal-interval scale. With sufficient 
ingenuity, it may be possible to devise theoretical analyses and ex- 
perimental tests that will enable us to approximate certain functional 
relationships so closely that they may become a basis for equal-interval 
scales. 

I believe that it is worthwhile to devote further attention to a 
rigorous though qualitative approach which first determines the general | 
significance of the variables, and then goes on to narrow down the choice 
toward a more exact quantitative function. 
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INTRODUCTION 


Clark L. Hull began publishing descriptions of his theory of learned 
behavior around 1930, and from then until his death in 1952, devoted 
his energies to the testing, modification, and extension of his theoretical 
ideas. During the first decade of this period, Hull worked closely in col- 
laboration with Kenneth W. Spence, Hull concentrating on the basic 
principles involved in classical and instrumental conditioning and Spence 
concerning himself primarily with his separate but related theory of dis- 
crimination learning. Their interests expanded following publication of 
Hull’s Principles of Behavior in 1943. That is, Spence turned to a more 
detailed and systematic study of the data underlying the basic assump- 
tions of a theory of simple learning phenomena, the results of which have 
only recently been collected in Behavior Theory and Conditioning, while 
Hull began to attempt the application of the initial basic principles to 
more complex individual behavior as described in his posthumous work, 
A Behavior System. 

A number of others, notably Neal E. Miller, have participated in the 
development of what has been characterized as the S-R reinforcement 
interpretation of learning. Miller’s many contributions to the basic theory, 
and his important extensions of it to conflict, social behavior, and psy- 
chotherapy are largely omitted here because they are represented else- 
where in the project. Indeed, properly to identify and credit all the con- 
tributors would be an exceedingly difficult task and has simply not been 
attempted. Since, however, the following analyses are either descriptions 
of or direct outgrowths from this general approach, the writer must dis- 
claim credit for many of the ideas while, at the same time, accepting 
responsibility for their presentation. 

In a larger sense, most of the orienting attitudes and much of the 
basic methodology are not unique to the Hullian approach and are 
equally applicable not only to other systems in psychology but also to 
other sciences. The complete identification of these interrelationships is a 
problem for historians and philosophers of science, and the present ac- 
count will offer only a minimal attempt to trace the historical antecedents 
of the ideas or to contrast them with alternatives. The purpose, instead, 
is to characterize the Hullian approach as seen by the present writer, and 
readers interested in a more comparative account should consult Spence 
[175, 176] and Hilgard [64, 65]. 
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The approach falls within the general rubric of behaviorism as 
initially set forth by Watson. That is, the subject matter is the behavior 
of living organisms, and as Spence has pointed out, this does not differ 
in kind from the subject matter of other sciences. The most basic 
assumption is that behavior exhibits sufficient order and regularity that 
lawful description of it can be made. This is, of course, determinism. 
At least it is assumed that any “free” effects are so small or so infrequent 
as to make determinism a fruitful working hypothesis. 

The general attitudes, techniques, and methods of the approach have 
been discussed by Spence [173] and are largely imitations of some of 
those developed by the successful physical sciences. Thus, interesting 
phenomena are abstracted from the natural environment and subjected 
to systematic and controlled study. The data from such studies are re- 
ported in objective, i.e., interpersonally reliable, terms and form the basis 
of behavioral laws. 

The singular goal of the approach is the derivation of these behavioral 
laws, or particular instances of them, from an explicit set of postulates. 
The postulates, plus the relevant boundary conditions of any study, serve 
as “givens” from which the theorems are derived by the rules of logic 
and mathematics. The usefulness of the postulates is judged solely by 
the correspondence between these theorems and the appropriate empirical 
events, 

The ability to derive theorems accurately corresponding to empirical 
phenomena is the sense in which the theory provides “explanation” of 
the phenomena. Of course, at a more molecular level, one might seck to 
explain (describe, deduce) the postulates themselves, but this is not 
necessarily a task of the theorist. Similarly, applied psychologists may 
utilize the predictive power of the theory for practical “control” problems, 
but the theorist’s activities fall within the realm of pure science. 

In implementing the approach, it is assumed that some of the basic 
principles governing learned behavior can be determined from a study of 
the relatively simple learning phenomena: classical and instrumental con- 
ditioning. These experimental paradigms are not thought of as trivial 
laboratory settings; a great deal of “eyeryday” behavior can be seen to 
be uncontrolled instances of them. However, it is not argued that the more 
complex phenomena can be understood solely in terms of the principles 
derived from the simpler situations. New variables and different specifi- 
cations of old variables are most probably involved in the more complex 
areas, but it is assumed that the basic principles will, by and large, still 
apply. Should this be true, then the reasonable approach is first to isolate 
these principles in the simpler situations and only then turn to the more 
complex ones. 

This hypothesis cannot be evaluated a priori; its value is entirely a 
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matter of fact. Although one may doubt the likelihood that the hy- 
pothesis will be true [see, e.g., 61, 11], it is yet too early to make any 
definitive statements in this regard. There have, however, been a number 
of encouraging attempts to apply some of the principles to more complex 
areas. These have included discrimination learning [164], verbal learning 
(70, 47], motor learning [2, 42, 86, 194], problem solving [8], social 
learning [123], legal development [147], culture [124], kinship termi- 
nology [132], and most extensively, psychotherapy [37]. 

As mentioned earlier, Hull was willing to explore the way in which 
the theoretical approach would deal with more complex phenomena 
without fully resolving many of the specific problems in his 1943 theory. 
This did not reflect a failure to recognize such problems nor an under- 
estimate of their importance. Indeed, his subsequent research was di- 
rected at the formalization and quantification of the postulates. However, 
no one was more aware of the tentative and illustrative nature of his 
work than Hull; he seldom referred in conversation to what he had done, 
talking instead about what could be done by those who would rebuild 
the theory. 

In contrast to Hull’s willingness to risk almost certainly being wrong 
about many details in order to illustrate the approach more extensively, 
Spence has followed a more intensive and rigorous program. He has held 
his theorizing close to the available data and has concentrated on a 
systematic empirical study of some of the more quantitative assumptions. 
On his part, this view has not resulted from a denial of the ultimate 
possibility of a comprehensive behavior theory, but rather from the belief 
that it must be attained gradually. 

There is much to be said for both approaches. Certainly an adequate 
comprehensive theory should be built upon the solid foundation provided 
by systematic and detailed testing of the theory in the simpler situations. 
Yet, in Hull’s day, there was a need to gain perspective on the larger goal 
and to show that the approach was at least potentially useful as a means 
toward a general theory of learned behavior. Even now, a great deal of 
important information can be fed back by those who encounter difficulties 
in attempting to apply the theory to more complex phenomena. 

However, if there is any one thing that has impeded the development 
of better theories, it is the paucity of basic, systematic data and of re- 
peatedly verified laws. Although these are the kinds of empirical knowl- 
edge that are most likely to stimulate useful theoretical ideas, few 
psychologists have been willing to undertake the painstaking labor neces- 
sary. The psychological literature contains many tentatively established 
findings which invite replication and systematic elaboration, and the 
theorist frequently finds himself not only guessing with his postulates, but 
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also speculating about the phenomena his postulates are intended to 
describe. 

This state of affairs is probably justified in a relatively young science 
where the initial identification of significant variables has put a premium 
on creativity and originality. Certainly the Hullian approach has con- 
tributed greatly to the advancement of empirical knowledge during the 
past twenty-five years of this exploratory stage. This has resulted largely 
from an emphasis on the value of systematic and integrated research 
programs, rather than the haphazard study of any variable which 
chanced to catch the fancy of an experimenter. Perhaps now, at mid- 
century, one of the major contributions of the Hullian approach will be 
so to affect the attitudes of psychologists that greater value will be at- 
tached to the rigorous and systematic determination, and further verifica- 
tion, of psychological laws. 

The present description of the Hullian approach will be made in 
three major parts. In the first are treated several matters which are 
general characteristics of the approach and are not peculiar to any 
specific set of postulates. We shall then turn to a description of the in- 
dependent, intervening, and dependent variables with which the theory 
has been concerned, contrasting some of the current alternative formula- 
tions, Finally, we shall discuss a variety of research areas in an attempt to 
illustrate the interplay between the theory and research. It should be 
recognized at the outset that the treatment given will not provide a 
completely comprehensive account of the approach since many important 
topics are omitted or dealt with only briefly. However, most of the prob- 
lems which seem important at this juncture are raised, and the 
descriptions given, supplemented by extant writings on specific issues, 
should permit a reasonably broad characterization of the approach. 

We shall consider concomitantly several formulations by Hull and 
by Spence. Emphasis is placed on Hull’s 1943 and 1949 postulates, and 
on Spence’s analysis of discrimination learning and the theory given in 
his Silliman Lectures at Yale. In addition to these, occasional reference is 
also made to the micromolar theory being developed by the present 
writer, and because this approach has not yet been widely discussed, a 
brief description of it here seems necessary. 

The classical theories of Hull and Spence define the response in 
molar terms, like pressing a bar, running down an alley, etc. All the 
quantitative characteristics of a response are predicted from a single 
terminal construct, excitatory potential. That is, the independent variables, 
Such as practice, drive, reward, etc., converge upon excitatory potential 
and this single number is used to determine whether the response will 
occur, how quickly it will occur, and what its amplitude will be. This 
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kind of theory has several general implications. Most critically, one 
would expect that the different response measures would increase as 
training progresses, and would be highly correlated with each other. 

A micromolar theory, on the other hand, identifies quantitatively dif- 
ferent behaviors as different responses with separate excitatory potentials. 
Thus, running fast is a different response from running slowly, rather 
than being simply a stronger strength of approach. Such a theory does 
not imply that response measures will necessarily increase with training; 
they will increase only if the subject has something to gain from such 
increases. Response speed is a case in point, because the subject can 
usually get the reward sooner by responding faster. Accordingly, one 
would expect speed measures to increase with training, as they typically 
do. However, a measure such as the force a rat exerts on a bar does not 
continue to increase with training because it does not pay to press the bar 
harder than necessary to activate the feeding mechanism. 

Similarly, a micromolar theory does not require that the several 
response measures be correlated with each other. Instead, the organism 
is viewed as learning whatever response pattern maximizes reward while 
minimizing effort, be it a slow, vigorous response, a fast, gentle one, or 
some other combination. Conditions in which maximum reward is given 
for some particular speed or amplitude are called correlated reinforce- 
ment, and data from such conditions are especially relevant to the 
question of whether quantitatively different behaviors should be treated 
as different responses. 

Notwithstanding the promise which a micromolar approach seems 
to have, it has not yet been sufficiently developed to present in more 
than a general way. However, it serves as a good illustration of one of 
the most important features of the Hullian approach, namely, that every 
aspect of the theory is modifiable in the face of advancing knowledge. 
Small details are changed quite readily. And even more general aspects, 
such as the definition of the response and the assumption that reward is 
necessary for learning, are not irrevocably cast. Accordingly, the Hullian 
approach involves much more a commitment to a method of theorizing 
than to a particular set of theoretical ideas. 


THE HULLIAN APPROACH TO THEORY IN PSYCHOLOGY 


In this part are discussed a variety of methodological questions about 
the Hullian approach to theory in psychology. Specifically, we shall con- 
. sider how the postulates are constructed and quantified, how theorems 
are derived from them, and how the theory is related to other disciplines. 
In doing so, it will be useful to begin with a brief description of the 
nature of Hullian-type theory. While this description is sufficient for our 
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purposes here, the interested reader should consult the works of Spence 
[170, 173], Bergmann and Spence [7], and Bergmann [5, 6] for a more 
complete and analytical account of those matters within the realm of the 
philosophy of science. 

The nature of a Hullian-type theory. In order to characterize a 
Hullian-type theory, it will be helpful first to describe briefly a con- 
ceptual hierarchy of behavioral laws. Such laws, or empirical generaliza- 
tions—see [170] for a description of the several kinds of psychological 
laws—organize the facts from behavior research into statements typically 
of the form “If S then R” from which predictions can be made about 
future instances of that S. 

Even within a single field, such as psychology, laws may vary in 
their level of generality, i.e., in the range of phenomena to which they 
are relevant. For example, one might generalize from Humphreys’ early 
experiment [75] that, “If, during the classical conditioning of the eye- 
blink response, the UCS follows the CS on only 50 per cent of the trials, 
resistance to extinction will be greater than if the stimuli are paired on 
every trial.” If this statement is contrasted with, “If a response receives 
partial reinforcement during acquisition, resistance to extinction will be 
greater than following continuous reinforcement,” one can sce that the 
latter applies to instrumental as well as to classical conditioning, to re- 
sponses other than the eyeblink, and to ratios other than 50 per cent. The 
second law is, therefore, at a higher level of generality because the former 
is deducible from it. 

More general laws are desired, of course, because they subsume a 
wider range of phenomena under a smaller number of statements. Further 
generality may be gained by bringing several higher-level laws together 
into a system integrated by their relevance to the same behavior variables. 
To illustrate this fact, consider another relatively high-level behavioral 
law, “A response conditioned to one stimulus will tend also to occur to 
similar stimuli.’ Now the laws of stimulus generalization and partial re- 
inforcement, together, imply more than the sum of their separate im- 
plications, One can predict, for example, that subjects trained under 
partial reinforcement to one stimulus will resist extinction to a generalized 
stimulus more than other subjects trained with continuous reinforcement. 
This last statement is not less general than either of the other two laws 
Separately; the added generality has come from the combination of laws. 

One might be content to seek a system of relatively high-level laws 
within one’s own field. Such a system would serve to order the available 
knowledge and would permit predictions concerning a fair range of 
phenomena. A Hullian-type theory, however, is more than this, as can 
be seen if the conceptual hierarchy of behavioral laws is extended to 
include the several behavior sciences. 
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If the empirical laws of one field are, in principle, deducible from 
those of another, then the field whose laws are subsumable can be said 
to be more molar, and the subsuming field, more molecular. Level of 
molarity is, of course, a relative matter, psychology appearing to be molar 
relative to physiology but molecular relative to sociology. The laws of 
all behavior sciences may be conceptualized in a single hierarchy of 
generality, those from the more molecular fields occupying the higher 
levels because of their potentiality, in principle, of implying the laws of 
the more molar fields. 

Within this conceptual schema, the postulates of a Hullian-type 
theory are around the level of physiological laws. The elements of the 
postulates, i.e., intervening variables, are around the level of physiological 
processes. However, the postulates are not asserted as genuine physio- 
logical laws. They were constructed solely to handle behavioral data in 
psychology, and it is important to distinguish between the mathematical 
statements and other possible connotations of the postulates. Thus, the 
postulate that habit strength grows as a function of the number of rein- 
forced trials could imply, among other things, that there is some 
physiological process that could be labeled habit strength and which 
changes according to the specified function. Such physiological implica- 
tions may be disregarded because the theory is properly evaluated only 
by the correctness of its psychological implications.’ 

The right of the theorist to disclaim unintended physiological inter- 
pretations is a part of his general right to delimit the range of relevant 
phenomena to any areas he chooses. Thus, Spence’s early theory of 
discrimination learning was closely restricted to nonarticulate organisms 
in situations insuring reception of the critical cues. Also, Hull tacitly 
avoided the phenomena of relearning with his postulates and rejected 
conditions of correlated reinforcement. Ideally, all such limitations should 
be stated explicitly by the theorist although, in practice, he may not have 
recognized some of these and may not be ready to specify others because 
he has not yet considered them sufficiently to know whether the theory 
is inadequate. One who argues that the theory is proved wrong by the 
falseness of any such unintended implications completely misses the point 
of the activity. The purpose of the theory is to mediate predictions over a 
specifiable range of phenomena, and the theory is useful to the extent 
that it does just that. 

Clearly, then, the postulates do not purport to describe the structure 
of the “real” organism. Instead, they describe a “model” organism whose 


For a more philosophical way of approaching these matters, consult Mac- 
Corquodale and Meehl [112], who attempt to distinguish between intervening 
variables and hypothetical constructs, and related articles by Ginsberg [48], Marx 
[118], and Maze [119]. 
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construction may differ markedly from that of the real organism but 
whose relevant psychological input-output properties are the same, It 
is in this sense that one can say that the real organism behaves as if it 
were built like the model. 

It would seem reasonable to ask why one builds such a theory rather 
than simply a system of empirical laws within one’s own field. Actually, 
however, the question is fundamentally one of degree, i.e., of the level 
to which one attempts to generalize his knowledge. Accordingly, it is a 
matter of the scientist’s preference, and may be influenced by a number 
of factors. Primarily, the ability to predict concerning a wide range of 
phenomena from a small number of statements is taken to be a goal 
of science, and theory construction is an approach to this goal. It is 
particularly appealing because psychological laws seem so numerous, 
complex, and at times mutually inconsistent that it appears more difficult 
to formulate a lower-level empirical system than to construct a theory. 
Thus, Spence [180] has collected a variety of learning curves involving 
different response measures, different apparatuses, and different organ- 
isms, and has shown how all of these may be seen as instances of a few 
hypothetical equations rather than as a miscellaneous collection of diverse 
empirical laws. Such a result, multiplied several times over, provides the 
most immediate justification of the approach. 

Relation to the areas. More molecular fields. We have already noted 
that the postulates can be given physiological interpretations but that the 
value of the theory to psychology does not depend on whether physio- 
logically correct interpretations can be found. Of course, the theorist 
need not disclaim such interpretations, or may at least be interested in 
the degree of correspondence between the postulates and physiological 
laws. Indeed, he may actively employ the available phyisological knowl- 
edge when building a “model” organism. 

From the point of view of an integrated behavior science, one would 
prefer a system of true physiological laws that would incorporate 
physiological as well as more molar phenomena. However, from the point 
of view of predicting only psychological phenomena, a model organism is 
just as useful as the real one. Clearly, if one can successfully predict the 
behavior of a model that acts like the subject being studied, then one can 
equally well predict the subject’s behavior. Furthermore, it is not im- 
possible that psychological phenomena could be predicted from a model 
that is structurally simpler than the real organism. f 

The writer is not competent to make a detailed comparison of the 
postulates with physiological facts. For that matter, the most relevant 
kinds of physiological work have only recently begun to appear [e.8., 
182], and most physiological psychologists would probably admit that 
their field is not yet ready to support such an analysis. Premature at- 


302 FRANK A. LOGAN 


tempts run the risk that apparent specific discrepancies in one part of the 
theory may balance unknown, but compensating, errors in other parts. 
In such a case, changing a postulate to make it more consistent with 
physiology could, at the same time, make the theory as a whole less 
useful for psychology. Accordingly, neither field seems ready for a con- 
certed effort to integrate psychological and physiological phenomena 
within the same system. 

Other areas of psychology. Although there have been a variety of 
attempts to apply the theory to relatively complex psychological phe- 
nomena, it is primarily concerned with classical and instrumental condi- 
tioning. We have previously noted the assumption that the basic prin- 
ciples derived from these simpler forms of learning will also apply to 
other areas of psychology, presumably with the addition of new variables. 
Of the new variables which will be required in order to deal satisfactorily 
with human behavior, language currently appears to be the most critical. 

The fact that humans can talk, to themselves as well as to others, 
may be expected to affect essentially every variable in the system. To 
mention a few of the ways language can enter into a behavior situation: 

1. Words, as stimuli, may modify the stimulus complex and bring in, 
instrumental responses previously conditioned to them. 

2. The human may be able to gain additional habit strength by re- 
hearsing stimulus-response-reinforcement sequences verbally, the effect 
of which will depend upon the degree of correspondence between the 
implicitly practiced relationships and the objective situation. 

3. The occurrence of words that have been conditioned as secondary 
drives will affect the human’s motivation. 

4. Similarly, the incentive motivation can be affected by verbal cues, 
particularly that based on delayed reward. Thus, on the one hand, words 
can be used as secondary reinforcers to span a long delay of reinforce- 
ment, as when one says, “Tl get paid for this tomorrow.” On the other 
hand, a past stimulus-response sequence can be reinstated verbally im- 
mediately preceding the reinforcement, as when one says, “I’m getting 
paid for what I did yesterday.” 

Although the points listed above do not exhaust the possible effects 
of language, they do point to the importance of its inclusion in the theory 
if one wishes to deal with complex human behavior. The “pure-stimulus- 
act” [68] or “cue-producing-response” [37] appears to be the most 
promising approach, but it still remains to be formally implemented. It 
is one thing to predict what would happen if a sentence occurred; it is 
quite another thing to predict that the sentence will occur. Since words 
are presumably responses, learned and performed according to the 
principles of the theory, the solution of the problem should be possible 


[see, e.g., 50]. 
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More molar areas. Psychology is to sociology, cultural anthropology, 
etc., much as physiology is to psychology. Accordingly, the more molar 
social scientist may choose to operate without reference to psychological 
Jaws, just as the learning theorist may ignore physiology. However, be- 
cause a comprehensive theory of learned behavior should also apply 
to social phenomena, it is interesting to explore the extent to which the 
current theory meets this objective. At the present time, one can only 
do this informally because it is necessary to assume specifications of the 
intervening variables that have not been stated formally as yet. For 
example, in analyzing a problem like the resistance of a culture to change, _ 
e.g., the difficulty of introducing a new culture element, one might cast 
it in an approach-avoidance paradigm. Variables such as giving up 
money, pressures for social conformity, vicarious learning, etc., must 
somehow be related to existing, or possibly new, intervening variables, 
and the molar phenomena must be translated into familiar response terms 
like the probability of adopting the element. This kind of informal and 
incomplete analysis not only strengthens the hope for a useful compre- 
hensive theory but may also suggest important research problems to 
workers in the several fields involved. 

The integration of the behavior sciences seems, at mid-century, to 
be a natural development for the future. This does not mean, of course, 
that all social sciences are destined to disappear into physiology. ‘The 
range of organismic behavior is so great that there probably will always 
be fields corresponding to circumscribed aspects of behavior. Indeed, 
the increasing wealth of empirical knowledge is leading to narrower 
specialization on subareas within each field. Integration is more likely to 
involve primarily a recognition of the interrelationships between the fields 
and the adoption of a common language system. This will permit inter- 
communication and cross-fertilization, and eventually a comprehensive 
behavior theory. 

Method of postulate construction. Since the postulates are statements 
which, structurally, could be laws, the method of their construction is 
analogous to induction. There are no rigid rules for this kind of “induc- 
tion” except that the result must imply the original statements deduc- 
tively. Of course, a number of solutions may meet this criterion, and the 
postulates, therefore, cannot correctly be viewed as being unique. How- 
ever, as the number of original statements is increased, the number of 
solutions presumably decreases, and it is then likely that various solutions 
will be isomorphic or differ mainly in level of generality. 

Typically, in addition to the restriction that the postulates imply 
the statements used in their construction, the theorist will abide by other 
self-imposed constraints. The most common of these is parsimony, i.e., 
the theorist prefers the simpler of the alternatives apparent to him. Fre- 
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quently, the theorist is also interested in plausibility, i.e., he will select 
that alternative which fits most clearly with his informal ideas of how an 
organism might reasonably be built. Thus, Hull was concerned that his 
postulates seem plausible in terms of their adaptive significance and extra 
confidence was attached to properties appearing to have survival value. 

Disregarding these extra constraints for the moment, let us consider 
how the postulates are, in fact, related to the data known to the 
theorist. Consider a function relating some response measure to a single 
stimulus or antecedent condition. As a simple illustrative example, assume 
one had found a linear function relating performance R to the number 
of reinforced trials N, i.e., 


R=aN+6 (1) 


Suppose further that one also already had the following postulates: 
(a) Performance is a linear function of excitatory potential (.E,), i.e., 


R=a,E,+b (2) 


(b) Excitatory potential, in turn, is the simple multiplication of drive 
(D) and habit (Hr), i.e., 


‚Er = D(sH,) (3) 

(c) Reinforced trials affect only habit strength, i.e., 
‚Hr = f(N) (4) 
and D #f(N) (5) 


From these, and the known empirical function, we can trace the fol- 
lowing derivation: From (2) and (3), 


R=a'D,H,+b' (6) 
From (5), D is a constant, so we may define 
a =a D 
and therefore, 
R =a" H,+b' (7) 
Then, from (1) and (7), 
aN + b =a" H, + b' (8) 
which simplifies to 
a b-b 
HN (9) 


One would, therefore, know that habit must be a linear increasing 
function of the number of reinforced trials, which happens in this case 


The Hull-Spence Approach 305 


also to be the form of the illustrative empirical function. Observe closely, 
however, that a conclusion could not have been made at all without the 
kind of assumptions contained in Eqs. (2) to (5). Furthermore, a quite 
different conclusion would have followed from the same procedure were 
these assumptions different. Thus, if N also affected drive, D would not 
disappear as a constant, and it would then be impossible from these data 
alone to separate the effects of N on sH, from its effects on D. Were 
sE, some other function of D and +H», or were R some nonlinear function 
of +E», the concluding function would be affected accordingly. Depend- 
ing jointly on these matters, the theoretical function may or may not 
look very much like the empirical function. Despite appearance, however, 
each would have the same logical relation to the data, namely, that, 
within the constraints provided by the rest of the theory, each is the only 
solution which will correctly imply the empirical function. 

One may question the propriety of this procedure on the grounds 
that the results are no better than the data used, and typical empirical 
data include considerable error variance. What the theorist does is freely 
to select and modify the data in terms of what he thinks to be the 
true function, the deviation from the actual data depending upon his 
estimate of their reliability. Fortunately, this rational procedure can be 
tested only by more reliable data, and these latter data can then be 
put back through the mill, if necessary, to generate a better postulate. 


Taste 1. Two HYPOTHETICAL Sets or DATA DETERMINED FROM A FACTORIAL 
Srupy ın Wuicn Boru Hours or DEPRIVATION AND MAGNITUDE 
or Rewarp WERE VARIED 


Amount of Amount of 
d 
Hours of deprivation an Hours of deprivation FENAN 
High | Low High | Low 
High 4 3 High 16 8 
Low 2) 2 Low 8 4 


doro ee eee 


The inductive process follows essentially the same logic for more 
complex questions requiring factorial designs of experiments. Consider, 
for example, a question in which Spence has been interested, the nature 
of the interaction between drive and incentive. In Table 1 are shown two 
possible sets of empirical data from which this interaction is to be deter- 
mined. The matrix on the left shows no interaction in measured per- 
formance between amount of reward and hours of deprivation and, 
taken directly, would indicate an additive relation between the inter- 
vening variables anchored to them. The matrix on the right, however, 
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would suggest a multiplicative interaction. Determining from these data 
the interaction between intervening variables, however, again requires 
other prior assumptions. Most critically, one must assume, as Spence 
has, that the performance measure is linearly related to response tend- 
ency. Thus, the matrix on the left is simply log: of the matrix of the 
right, and a multiplicative performance interaction would imply an 
additive theoretical interaction were response tendency a logarithmic 
function of the response measure used, and vice versa.” 

Since other parts of the theory are necessary to this “inductive” 
procedure of obtaining any one postulate, it is reasonable to ask where 
one first obtains these other parts. How, that is, does one begin? Un- 
fortunately there is no simple answer to this question. Certainly the 
notions of parsimony and plausibility play a role, but they alone are 
not enough. What is needed is the spark of theoretical insight, the utter- 
ing of a sentence. The theorist scans a collection of different sets of data, 
methodically trying out various structures and using each failure as an 
additional cue to evoke more hopeful ones.” It would be hard to over- 
estimate the time this trial-and-error process may consume, or properly 
to describe the importance a new sentence can have. Saying for the first 
time, “Suppose we separate the habit and performance factors,” or the 
like, stimulates a new structure that can then be developed by the pro- 
cedure just described. It is their frequency of output of just such useful 
sentences that has given stature to Hull, and to Spence, and to Miller, 
and will to others of subsequent generations. 

Manner of deriving theorems. Theorems are derived from the theory 
by the conventional rules of logic, including mathematics. That is, the 
postulates serve as premises which, together with the statements describing 
some particular conditions, imply the theorems deductively. It is im- 
portant to recognize that theorems are implied by the entire theory, not 
by any segment of it separately. Some parts of the theory may not be 
differentially involved in a particular situation, but proof of this fact is 
a genuine, if often ignored, part of the derivation. 

Precise mathematical derivation is the ultimate goal of the theory, 
but until the postulates are exactly quantified, derivations must be made 
within the algebra of inequalities. Thus, if one wished to derive from the 
1943 Hull system relative performance by two groups differing only in 


*The theoretical interaction can be obtained without this assumption in the 
special case where the functions intersect at some point. Thus, learning curves 
diverging from the same starting point imply a multiplicative interaction with 
habit provided only that performance is a monotonic function of response tendency: 

®In this context, the advantage of the simpler learning situations is that they 
involve fewer variables and therefore suggest a smaller number of alternative 
structures. 
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hours of food deprivation, the deprivation would be: 


Rui pv = f(sEr) = Fl(sAr HN NZ sOn — sLr] 
Vv Vv DAN I I I i] 
Rio n = (sEr) = fl(sHe* D) — sIr — Ir — sOr — sh] 


This procedure, of course, can be used equally well with more than two 
groups to include also second-order differences, although in such cases 
it is typically convenient to make the derivation graphically. 

When the problem is too complex to be handled conveniently in this 
manner, the derivational procedure is to assume illustrative numbers for 
the parameters involved, and to make a mathematical derivation using 
these numbers, Although this procedure yields conclusions involving 
exact numbers, they are only illustrative and the theorems are stated in a 
greater-than language. 

In some cases, the same greater-than result would follow regardless 
of the particular illustrative values used. When this is true, there can be 
no question about the logical correctness of the theorem. In other cases, 
however, one set of illustrative values gives one answer while the opposite 
answer can be obtained if other values are used. For example, in a 
choice situation, drive emphasizes any habit difference that exists, High 
drive, therefore, improves performance only if the correct response has 
the dominant habit. Therefore, whether the correct choice will be learned 
faster under high drive than under low drive depends upon what one 
assumes about the initial levels of the competing habits, and the relative 
rates at which habit and inhibition are built up. Under such circum- 
stances, the most conservative approach is to reject the problem as a 
possible test of the theory. Frequently, however, the result changes 
systematically with changes in the illustrative values, and the theory can 
then be tested by a detailed study of the relevant variable. 

One interesting illustration of this last point appears in A Behavior 
System.‘ Hull was attempting to derive that discrimination learning 
would be more difficult if the difference between the positive and nega- 
tive stimuli was held constant and their absolute intensities increased, 
but he obtained the reverse relationship by the use of illustrative numbers. 
This result was peculiar to the illustrative dynamism and generalization 
functions and to his choice of the weaker stimulus as the positive one and 
the lower intensity pair at or near the threshold. If the same derivation 
is made for other pairs of stimuli, the result is more in line with the 

“When provided with the evidence that his derivation was unique to the 
conditions chosen, Hull declined to use it. Partly this was because it would have 
required changes in other parts of the text, at a time when he was not physically 
up to that task, Primarily, however, Hull wanted to show by example that a 
theorist himself should be willing to publish the failures as well as the successes of 
his theory. 
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Weber-Fechner law. The theory actually predicts an inversion in the 
function, with less intense pairs being progressively less difficult to 
discriminate except at or near the threshold. 

Quantification of the theory. Quantification is a matter of degree, 
varying from indications of the direction of relationships, through state- 
ments of function forms, to exact specifications of all parameters in the 
equations. The ultimate goal of the Hullian approach is to attain the 
final level of exact quantification. 

Quantification is important enough in its own right to merit serious 
attention because only a fully quantified theory can yield exact pre- 
dictions and therefore permit more rigorous tests. An even more pressing 
motive arises from the fact that a theory at a lower level of quantification 
simply cannot deal with situations in which several variables which have 
opposite effects upon performance are varied at the same time. That is, 
if one variable favors one condition and another variable favors the 
other condition, then no unique solution can be gained from the algebra 
of inequalities. 

Of the difficulties which must be overcome in order to achieve full 
quantification, the most central concerns the unit of measurement. Data 
collected in psychological laboratories, even when detailed enough to 
permit the fitting of functions, usually seem to be unique to the particular 
apparatus and species of organism involved. The order of numbers varies 
markedly from situation to situation; achieving a general theory, stated 
in exact quantitative form, would, therefore, appear unlikely. 

Tt is not yet clear how this problem will be solved, but two possible 
lines of attack upon it may be mentioned. 

One is represented by Hull's quantification methodology. There is no 
need to describe this methodology here [see 74; see also 27, 28, 88] nor 
to repeat some of the difficulties with Hull’s use of it [see 104]. The work 
with Hull’s quantification methodology should be viewed as primarily 
illustrative, since Hull wished to demonstrate how it could be used, and 
how it would affect the postulates, if properly implemented. Adequate 
testing of the procedure remains to be done, but what is important in this 
context is to recognize that it is an attempt to scale performance in 
terms of a unit (based on the sigma of oscillation) which might not be 
peculiar to a particular situation. That is, it might be possible to trans- 
form raw data, the units of which vary from situation to situation, into a 
more nearly universally applicable unit. 

Another possibility of surmounting the problem posed by varying 
units of measurement may be termed “relative” quantification. Such an 
approach would succeed if the parameters in the various postulates bear 
the same relationships to each other even though their absolute sizes vary. 
Thus, if inhibition always grows at (say) twice the rate of habit, these 


The Hull-Spence Approach 309 


postulates could be quantified relative to each other. To employ such a 
theory in a new situation would require a pilot study to obtain the abso- 
lute size of some parameter from which the remaining ones could then be 
estimated. 

Implementation of either of these approaches to quantification is a 
project of considerable magnitude, and should reasonably be preceded by 
more detailed testing of the postulates at a greater-than level. The size of 
the task results mainly from the fact that the entire theory must be 
quantified at the same time.” Since theorems are implied by the system 
as a whole, numbers must be in all postulates in order to derive numerical 
theorems. 

Other problems, hardly less serious, face the quantitatively oriented 
theorist. For example, the parameters may differ markedly from subject 
to subject so that conventional techniques of averaging the performance 
measures may produce serious distortions of any individual function. 
Spence [180] has described the techniques, based upon the selection 
of homogeneous groups of subjects, that he has been developing to solve 
this problem. 

Both Hull and Spence have demonstrated their belief that the im- 
portance of quantification justifies the effort required. And despite the 
apparently immense difficulties, there is no way to find out to what extent 
a behavior theory can be quantified except by trying. 


THE THEORETICAL MODELS 


In this section, the independent, intervening, and dependent variables 
of the theory are identified and their interrelationships described. In 
doing this, the main focus will be on the theory as presented by Spence 
in his Silliman Lectures. However, he did not discuss all the kinds of 
simple learning data and so did not deal with all the variables that he 
has described in other sources and that are included in Hull’s theory. In 
order, therefore, to provide a more comprehensive picture, ideas ex- 
pressed elsewhere have been included when Spence did not state any- 
thing to the contrary. 

A detailed description of Hull’s theory in these regards has been 
made by Koch [88]. In his analysis, Koch distinguished between 
systematic and empirical variables, and this distinction will be main- 
tained here. Concepts to which the intervening variables are anchored 
are called “systematic” variables, and specific instances of these con- 
s may not need to be quantified at all 
the level being used. Thus, for 


hibition is generated by widely 
hat variable since it is 


"At the early stages, some postulate: 
levels, provided they are quantitatively specified at 
example, if the theorist assumes that no reactive inl 
distributed trials, then he may temporarily ignore tl 
quantitatively specified at zero. 
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cepts, “empirical” variables. Such variables are not different in kind 
but differ in their relative level of abstractness. Thus, one might tie in- 
centive to the weight of a particular kind of food, or more abstractly, 
to quantity of food reward, or still more abstractly, to amount of rein- 
forcement, etc. Whether quantity of food reward, for example, is a 
systematic or an empirical variable depends on the level of abstractness 
at which the theorist attempts to work, being a systematic variable if 
incentive is anchored directly to it, and an empirical variable if incentive 
is anchored more abstractly. 

More abstract concepts provide greater generality but are difficult to 
formulate adequately. Frequently, therefore, the intervening variables are 
anchored informally at one level and more formally at another. If, at 
any particular time, the theorist is not able to provide a satisfactory 
anchoring for more than a limited portion of what he expects ultimately 
to achieve, he may give the more general formulation as an informal — 
suggestion to permit trying the theoretical structure in areas outside its — 
more formally relevant ones. For example, were amount of reinforcement 
adequately defined to include all sources of reward; it would be a rela- 
tively abstract concept; and although incentive is formally anchored to — 
quantity of food reward, it may be informally employed when dealing — 
with other sorts of situations. Similarly, the hunger drive is formally 
defined by hours of food deprivation, but is also conceived in terms of 
any operations producing hunger. q 

This duality exists because available empirical evidence typically per- ` 
mits more complete and quantitative statements about some variables — 
than about others and also because of the difficulty of providing more 
abstract definitions. Formulating such definitions is an important task, 
but the absence of them should not be too disturbing at the present time. 
One may concentrate on determining the interrelations among known 
instances of concepts even before adequate comprehensive definitions of 
them have been formulated. Thus, so long as the defined concept “food 
to a deprived organism” is accepted as a member of the larger, partially _ 
defined class “reinforcers,” one can study the effect of this known in- 
stance on behavior without being immediately concerned with the more 
abstract definition. Indeed, just such sorts of interrelations are used in 
formulating the definition, because the adequacy of a tentative definition 
is tested by comparing the interrelations of new instances that it suggests 
with those of known instances. 

For this reason, theorists interested in the detailed, quantitative as- 
pects of the system use terms defined by their studies, i.e., reinforcing 
events are ones previously shown to be reinforcing events, drives are 
operations previously shown to produce drive, etc. The circularity of this 
procedure, on the defining instance, is well recognized, but is avoided 
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by generalizing the operations to new situations. One might prefer to 
have the defining characteristic some other than the property under 
concern. For example, notions such as drive reduction, stimulus-intensity 
reduction, etc., may be viewed as attempts to define reinforcement by 
some attribute other than the performance effect which it is invoked to , 
explain. This would be advantageous should it happen to be easier to 
determine whether an event has the independent defining property, but 
is not essential to the approach. 

Finally, before describing the variables of the theory, it should be 
noted that one of the potentially most fruitful ways to anchor input 
variables has not been used extensively in the formal postulates, This is 
the identification of some mechanism through which independent vari- 
ables affect intervening variables. This can be illustrated by the fractional 
anticipatory goal response ry which is the most well known of these 
mechanisms and which has been suggested by Spence as underlying in- 
centive motivation. Rather than anchoring incentive directly to the num- 
ber of trials and the amount and delay of reinforcement, as has been 
done formally, it could be tied to them indirectly through their effect 
upon the amplitude of rs. This would be advantageous because, by at- 
tributing to rg the properties of overt responses, one automatically covers 
a number of situations indirectly and does not have to provide separate 
postulates for them. Thus, how partial reinforcement affects incentive is 
unspecified in any of the explicit postulates, although it would follow 
deductively were the ry mechanism used. Similar is a change in stim- 
ulus conditions, distribution of practice, and many others. However, not 
only is it difficult to formalize the rọ mechanism but it would increase 
the labor involved in making deductions because one then also would 
have to solve the theory for rg. When this is done, other factors must be 
considered. For example, r, as a classical conditioned response should, 
during learning, have only a probability of occurrence. This offers the 
possibility of removing drive, oscillation, and threshold from instrumental 
response tendencies directly, because the quantitative properties of those 
intervening variables could then enter through the response tendency for 
fo. However, other characteristics, such as reactive inhibition, may have 
undesirable implications since rg might fatigue during a behavior chain. 
It must be emphasized that rs is an intervening variable and not a re- 
sponse of the real organism, and the theorist can therefore ascribe to it 
any properties he wishes, e.g., that it is not a fatiguable response. But 
Since such restrictions would encroach upon the intended parsimony, it 
has seemed prudent to leave the mechanisms as informal suggestions 
until the details have been more thoroughly worked out. y 

Summary of Spence’s theory. In describing the theory, emphasis is 
Placed on those aspects which offer interesting contrasts and which ap- 
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pear to be important at the present time. The description is incomplete, 
therefore, and some matters are treated in greater detail than others; 
hence, it may be useful first to give a rather general picture of the theory 
as formulated by Spence. 

As has already been said, the purpose of the theory is correctly to 
imply known and unknown S-R laws. This is done by the introduction 
of intervening variables which are anchored first on the stimulus, or 
input, side and then on the response, or output, side. The pivotal con- 
struct is excitatory potential since the independent variables feed into it 
and the dependent variables are determined from it. Because Spence 
described a two-factor theory, we must look at classical and instrumental 
conditioning separately. 

The excitatory potential of a classical conditioned response is in- 
creased by two prior intervening variables, habit and drive, each of 
which is in turn related to manipulatable stimulus and antecedent con- 
ditions. Habit is the major associative variable linking the response to the 
stimulus and is acquired gradually as a function of reinforced trials. Nor- 
mally, the pairing of the conditioned and unconditioned stimuli is suffi- 
cient. If, however, the UCS itself does not provide a reinforcing state of 
affairs (food, pain reduction, etc.), then habit will not increase unless 
some other source of reinforcement exists. Habit can also be developed by 
generalization from similar stimuli to which the response has been con- 
ditioned. Drive is the major motivation variable, aggregating the primary 
and secondary needs present at the time. The primary needs depend 
upon deprivation of food, water, and sex object, and upon the intensity 
of any noxious stimuli; the secondary needs depend upon the presence 
of any stimuli previously paired with noxious stimuli. Although drive is 
not typically referred to as an associative variable (linking the stimulus 
to the measured response), it may change during the course of classical 
defense conditioning because the cues in the situation are paired with 
the noxious unconditioned stimulus. In a similar vein, needs have char- 
acteristic stimuli which are a part of the total stimulus complex and so 
become conditioned to the response. 

The excitatory potential of a classical conditioned response is reduced 
by several inhibitory variables. The major inhibitory factor develops as 4 
result of nonreinforcement. Reactive inhibition, or fatigue, was not dis- 
cussed by Spence but he would presumably employ such a variable when 
dealing with responses requiring different amounts of work. Two other 
notions have inhibitory properties: oscillation, which reduces excitatory 
potential by an amount that varies from moment to moment according 
to a normal probability function; and the threshold, which is the value 
of excitatory potential below which the response will not occur. 

From the excitatory potential determined in the above way, one can 
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predict the probability of occurrence of a conditioned response by ascer- 
taining the likelihood that, on any trial, oscillation will not have re- 
duced excitatory potential below threshold. Response amplitude is pre- 
dicted as a simple linear function of excitatory potential. Spence did not 
derive the latency of a classical conditioned response from his model, 
although he presents data on this measure and suggests that it may be 
part of what gets learned (a micromolar assumption). Similarly, he does 
not derive resistance to extinction of a classical conditioned response 
both because there were no data available and because a number of fac- 
tors, such as a change in drive, enter into conventional extinction 
procedures. 

Spence’s treatment of instrumental conditioning differs from that of 
classical conditioning in a number of ways, but all are related to the way 
reinforcement is handled. Instrumental habits develop simply by the 
contiguity of stimulus and response, but otherwise function identically. 
The drive variable is the same for both kinds of situations. With habit 
now independent of reinforcement, the properties of a reward (prin- 
cipally, the amount of it) affect an additional variable acting to increase 
excitatory potential. This is incentive, which is gradually acquired as a 
function of the number of times the reward has been experienced in the 
situation. 

The fact that instrumental habits develop by contiguity suggests that 
the inhibitory factor in classical conditioning which is based on non- 
reinforcement must have a different interpretation in instrumental condi- 
tioning. This variable now represents the quantitative effects of incom- 
patible responses which occur on nonreinforced trials. And a new in- 
hibitory construct is introduced to reflect the effect of any incompatible 
responses which might occur in a delay period imposed between the 
completion of an instrumental response and receipt of reward. Other- 
wise, the inhibitory variables are the same for instrumental as for classical 
conditioning. q 

From the instrumental excitatory potential, one can again compute 
probability of response by the same method as for classical conditioning. 
Amplitude is not explicitly considered because a subject may learn par- 
ticular amplitudes under conditions of correlated reinforcement. But 
response speed and resistance to extinction are determined as linear func- 
tions of excitatory potential. 7 

Besides giving a brief survey of the theory, this sketch also helps point 
out what is meant by holding one’s theorizing close to the available data. 
If a variable involves complications which could equally well be handled 
in several ways, it is simply omitted until more definitive data are col- 
lected. If the definition of a concept cannot now be made at a relatively 
abstract level, one goes on working with instances already identified. If 
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different types of situations seem to lend themselves more readily to dif- 
ferent theoretical descriptions, separate analyses are given. Of course, 
one would certainly prefer a general, abstract, and integrative theory, 
but it would be unrealistic seriously to try to construct one in view of 
the limited extent of present empirical knowledge. A less pretentious 
theory may, indeed, better serve the role of guiding research by bring- 
ing more clearly into focus those problems which are ready for experi- 
mental attack. That is to say, a theory that is close to known data is likely 
to stimulate research close to the theory. 

The input intervening variables. The input intervening variables and 
appropriate systematic and empirical independent variables are given in 
Table 2. Since neither Spence nor Hull has provided specific definitions 
for all the terms involved, the statements given reflect the present writer’s 
perception, and not necessarily Spence’s or Hull’s interpretation. They 
are, therefore, only suggestive and are also partial in the sense that other 
operations can later be added as more abstract definitions are formulated. 

There are a number of differences in the input variables as described 
by the several theories being considered. Many of these are primarily in 
wording or emphasis, others refer to underlying mechanisms not ex- 
plicitly postulated, and some reflect relatively important issues. It is 
simply not practicable to discuss all of these in sufficient detail to expose 
the bases for the differences and to describe possible tests of them. Brief 
discussions of each major variable are given in order to identify the more 
important contrasts, and one variable, the time of delay of reinforcement, 
is treated in detail. This variable was selected because it has been dealt 
with explicitly in a variety of ways and therefore offers a particularly 
good illustration of the range of alternatives contained within the ap- 
proach and the interaction of these with the maturing body of psycho- 
logical knowledge. 

Stimulus. The external stimulus complex, called the “distal stimulus” 
by Hull and the “situational stimulus” by Spence, can be described in 
such physical terms as frequency, amplitude, wavelength, etc, Any energy 
impinging on a suitable receptor is presumed to initiate a stimulus trace 
which persists with decaying effect for an interval of time. Of the total 
external stimulus complex, that fraction which is perceived at any one 
time is called the “proximal” or “effective” stimulus. In addition, there 
are intraorganic stimuli based partly on proprioceptive feedback and 
partly on cues to the state of the organism (e.g., drive stimuli). The 
response is made to the stimulus complex composed of the intraorganic 
and effective stimuli. 

The effective and situational stimuli are not isomorphic, but the rules 
by which one determines the effective stimuli from knowledge of the 
situational stimuli are not yet very fully understood. One aspect of this 
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TABLE 2, THE INPUT VARIABLES 
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Empirical independent 


Systematic independent 


315 


Intervening 


Any physical energy impinging on a 
suitable receptor, as, e.g., a super- 
threshold tone. 

Specification of physical energy in fre- 
quency or location, as, e.g., the pitch 
of a tone. 

Specification of physical energy in am- 
plitude, as, e.g., the loudness of a 
tone. 

Physical energies present at the same 
time, e.g., a tone and a light. 


Hours of food deprivation on a regular 
feeding rhythm. 

Hours of water deprivation on a regular 
drinking rhythm. 

Hours of deprivation of appropriate sex 
object. 

Intensity of noxious stimulus, as, e.g., an 
electric shock. 


Temporal coincidence of stimulus and 
response followed by reinforcement. 


Temporal coincidence of stimulus and 
response. 


Temporal coincidence of stimulus and 
response not followed by reinforce- 
ment. 


Weight of food object given as reward. 

Time between response and receipt of 
reward, 

Experience of reward in the situation. 


Differences among stimuli in quality or 
intensity. 


Cumulated work done since rest. 
Time since response system was active. 


| 


Stimulus 


Stimulus quality 


Stimulus intensity 


Patterning 


Hunger need 
Thirst need 
Sex need 


Pain need 


Reinforced trial 


Trial 


Nonreinforced trial 


Amount of reinforcement 
Delay of reinforcement 


Trial 


Stimulus similarity 


Fatigue 
Rest 


Stimulus trace 


Drive 


Habit (classical 
conditioning) 


Habit (instru- 
mental con- 
ditioning) 

Inhibition 


(classical 
conditioning) 


Incentive 
(instrumental 
conditioning) 


Generalization 


Reactive 
inhibition 
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problem that has been emphasized by Spence as a result of his work 
on discrimination learning is the importance of receptor-orienting acts. 
Particularly in vision, the effective stimuli depend upon the orientation 
of the receptor, and many behavior situations involve learning such acts 
as well as the behavior being recorded. For this reason, care must be 
taken to insure reception of the relevant cues. 

Another aspect of this problem is the notion of afferent stimulus inter- 
action, Hull assumed that each stimulus element is affected by the con- 
text in which it is embedded, so that, for example, a tone in one environ- 
ment is different from the same physical tone in another environment. 
Spence did not discuss afferent interaction in his book, but has elsewhere 
stated that a stimulus element can be affected by contextual cues. When 
in a discrimination learning situation no single element (such as “white” 
vs. “black”) provides a solution, the subject will learn to respond to 
differentially reinforced stimulus patterns or compounds (such as “white 
on the left” vs. “white on the right”). Spence tentatively assumed that 
patterning does not occur unless it is required, whereas Hull’s stimulus 
interaction is presumably always involved. However, these might not 
lead to different implications because, if the effect of stimulus interaction 
is small, it might be observable only under conditions such as those as- 
sumed by Spence to be necessary for patterning. 

One further aspect of the stimulus deserves brief mention; this is 
stimulus intensity dynamism. In his last postulate set, Hull assumed 
that stimuli not only have cue value, but also motivational properties in 
proportion to their intensity. This is generally recognized to be the case 
for very intense stimuli, but dynamism applies also to stimuli below the 
pain threshold. Other things equal, then, performance should be at a 
higher level the greater the intensity of the stimulus, and stimulus inten- 
sity discrimination should be easier if the more intense stimulus is the re- 
inforced one. The writer [101] has elsewhere shown how such implica- 
tions can be derived from Hull’s theory without dynamism by attending 
to the fact that the subject is also learning not to respond to contextual 
cues. Whether there is a dynamogenic effect of stimulus intensity is an 
important question that can be determined from relatively straight- 
forward empirical designs. Spence did not take an explicit stand on this 
question because the research has not been done and because he did not 
deal with problems that would be materially affected by dynamism. 

Drive. Hull attempted a relatively abstract definition of drive based 
upon survival needs of the organism or species. It is recognized, of course, 
that some needs (e.g., oxygen deprivation) may not produce drive, and 
other conditions (e.g., curiosity) might involve drive without apparent 
survival need. Spence has taken the more direct approach of simply 
enumerating the known need-producing operations. 
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Spence did describe a possible mechanism underlying drive in situa- 
tions involving noxious stimuli. This mechanism is an emotional response 
elicited by such stimuli. Not only will more intense stimuli lead to an 
increase in this emotional response, but different subjects may become 
differentially responsive when given the same intensity of stimulation. 
These assumptions have led to a number of studies but Spence has 
nevertheless formally anchored aversive drive directly to the intensity of 
noxious stimulation. 

One important feature of the treatment of drive by both Spence and 
Hull is the assumption that all needs present at any one time summate 
into a generalized drive which then activates all habits. For example, 
in so far as its motivational properties are concerned, thirst will energize 
a response learned under hunger-food conditions as well as one learned 
under thirst-water conditions. Specific effects are handled by the assump- 
tion that each need produces characteristic stimuli. Thus, a subject can 
learn to make one response when hungry and another when thirsty be- 
cause the distinctive need stimuli acquire greater habit to the appropriate 
responses. Similarly, a change from one need to another leads to a gen- 
eralization decrement in habit because of the change in need stimuli. 

Habit. Habit is the variable at which the reinforcement-contiguity 
issue is directly aimed. The traditional Hullian assumption is that rein- 
forcement is a necessary condition for the development of any habit. 
Spence, however, has suggested a two-factor theory in which reinforce- 
ment is necessary for classical conditioning but not for instrumental con- 
ditioning.° Spence stated that his position resulted from the assumption 
that the habit strength of instrumental responses is not related to the 
property of a reinforcer. This assumption was highly tentative and was 
based on studies involving the goal gradient. 

Koch [87] and Seward [155] have noted that Hull’s 1949 postulates 
involved essentially a contiguity approach, since the growth in habit is 
dependent only upon some (unspecified) minimum or threshold rein- 
forcement, One can, however, also argue that Hull’s 1949 system is less 
similar to comparable contiguity theories than his 1943 theory, in which 
the increment in habit was partially a function of the amount of reward. 
The S-R contiguity theory of Estes [43], for example, holds that learning 
is independent of reinforcement, but that the amount of this learning 
which is retained does depend on the amount of reward. Thus, it is 
difficult to see any genuine difference in so far as net change in the asso- 


“Actually, Spence dealt only with classical defense conditioning and instru- 
mental reward conditioning, suggesting reinforcement as necessary for the former 
but not the latter, His two-factor approach may, therefore, depend as much 
on the nature of the need, appetitional or aversive, as on the kind of learning 
situation. 
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ciative variable is concerned as long as standard reinforcement proce- 
dures prevail. Hull’s 1949 postulate set is more similar to the kind of 
contiguity theory described by Spence for instrumental conditioning, _ 
where reward affects an incentive variable and has no effect at all upon — 
habit. And, as Spence correctly noted, his contiguity theory is more like 
the expectancy approach suggested by Tolman [187]. i 

Inhibition. There are two general ways to handle the phenomenon 
of experimental extinction. One is simply to reduce the habit built up 
during acquisition; the other is to posit an inhibitory variable which 
builds up during nonreinforced trials and which offsets the habit. Fol- 
lowing the lead taken by Pavlov as a result of his studies of spontaneous 
recovery and disinhibition, both Hull and Spence have chosen the latter 
alternative. 

Hull attempted to derive inhibition by use of a “stop-responding” 
mechanism. This analysis assumes that making a response produces 
fatigue, which is a kind of drive condition, and that stopping the response 
is rewarded by a reduction in this fatigue. This stopping of the response 
then becomes anticipatory and inhibits the overt response. There are, 
however, a number of difficulties with this mechanism which Hull did — 
not adequately resolve [see, e.g., 49]. 

Spence did not deal extensively with extinction but did postulate an 
inhibition variable as a direct function of the number of nonreinforced — 
trials in classical conditioning. This is consistent with his earlier analysis 
of discrimination learning [164] in which inhibition was assumed to be 
a direct function of nonreinforcement. In his Silliman Lectures, Spence — 
referred to the familiar contiguity assumption of counterconditioning of 
incompatible responses as underlying inhibition in instrumental condi- 
tioning, but he did not describe the analysis in detail nor did he formally 
introduce this mechanism. 

Incentive. Incentive is affected by two major variables: the amount 
and the time of delay of reinforcement. Since these have been treated 
in very much the same way, only the latter is discussed here. 

The time of delay of reinforcement entered into Hull’s 1943 postulate 
set as one of the determinants of the limit to which the habit strength of 
an instrumental response can grow.’ Such an assumption implies that the - 
performance of groups receiving different delays of reinforcement will 


"Delay of reinforcement was not explicitly identified as being involved in 
classical conditioning in 1943, Recently, Spence has suggested that this might 
be an instance of correlated reinforcement, i.e., the delay of reinforcement will — 
depend upon response latency when the CS-UCS interval is constant. However, we 
shall restrict ourselves here to instrumental conditioning, and to nonchaining delay 
defined as the waiting time between completion of the instrumental response ant 
receipt of reward. 


The Hull-Spence Approach 319 


diverge with training and stabilize at different asymptotes. A consider- 
able body of data supports this implication [e.g., 54, 135, 136, 137, 141]. 

However, the assumption that delay partially determines the limit 
of habit, coupled with the restriction that habit is relatively permanent, 
implies further that subjects changed from a short to a longer delay of 
reinforcement will not show a performance decrement, notwithstanding 
the fact that other subjects trained throughout with the longer delay 
perform at a lower level. Although some data [59] are in accord with 
this implication, other results [99, 155] show a drop in performance fol- 
lowing a shift to a longer delay. Of course, the shift to an effectively 
infinite delay produces experimental extinction, but this could occur for 
other reasons, e.g., inhibition due to nonreinforcement. 

In his 1949 postulates, Hull removed delay of reinforcement from 
habit, instead having it affect directly a new incentive variable J. Taking 
this literally, a change in delay would be expected to produce an im- 
mediate shift in performance, an implication which is not consistent 
with any known data. That Hull did not intend this implication,’ how- 
ever, can be surmised from his treatment of changes in amount of re- 
ward in A Behavior System. Since he there assumed that incentive 
changed gradually following a shift in amount of reward, even though 
the postulates literally imply an immediate change, one may reasonably 
assume that Hull would have treated a change in J in a similar fashion, 

Before going further, let us identify the differences between the 1943 
and 1949 positions. If all other variables are zero or unity, as appro- 
priate, then: 


1943 set sEr = M(1 — 10°) 
1949 set sEr = J(1 — 107%) 


where Er is reaction potential, M is the limit of habit strength, J is that 
part of incentive based on delay of reinforcement, and N is the number 
of reinforced trials. The most critical difference is that everything on the 
right side of the 1943 equation was called habit and could not decrease, 
while only the parenthetical portion of the 1949 equation was irreducible 
habit and J was a part of incentive and could be reduced. Had Hull 
kept his 1943 equation but let habit decay, the result would have been 
qualitatively identical to the 1949 assumption. Quantitatively, however, 
the 1949 equation provides a new degree of freedom, namely the rate of 
change of J following a change in delay of reinforcement. Letting habit 


“Hull had simply not designed the postulates generally enough to ‚handle 
changes in reward, and explicitly restricted their use to situations in which the 
teward conditions remained constant throughout training and testing. 
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decrease in the 1943 equation® would imply approach to the new (lower) 
asymptote at the same relative rate as original learning, while the 1949 
equation can permit J to change at a different rate. 

In 1947, Spence analyzed delayed reward in terms of immediate sec- 
ondary reinforcement and, following publication of Hull’s 1949 postu- 
lates, he supplemented his earlier proposal with the suggestion that the 
incentive variable could be related to the fractional anticipatory goal 
response rs [175]. In this analysis, delay of reinforcement is translated 
into the stimulus asynchronism for the classical conditioning of ry to the 
cues of the instrumental response, longer delays providing less favorable 
conditions. Then, either through the dynamism properties of the response- 
produced ss, or simply directly, the amplitude of ry is the mechanism 
determining incentive motivation. 

The 1-57 analysis explicitly makes incentive a function of the num- 
ber of times the reward has been experienced in the situation, i.e., the 
number of conditioning trials. Otherwise, the differences between it and 
Hull’s 1949 postulates are of a detailed quantitative sort. Primarily, the 
19-55 analysis implies a relationship between the rate of change in per- 
formance following a shift in delay of reinforcement and the rate of 
original learning," and also a relationship to stimulus asynchronism data 
from classical conditioning, neither of which is necessary using Hull’s 
approach. 

In Behavior Theory and Conditioning, Spence retained the effect of 
delay of reinforcement on incentive, but noted particular doubt about 
it. He assumed the main effect of delayed reward to be the incompatible 
responses which might occur in the delay period and subsequently com- 
pete with the instrumental response. In support of this position, he re- 
ports that the gradient of delay of reinforcement has a much lower, if 
any, slope when trials containing incompatible responses are not in- 
cluded. He then provides data showing that long delays are more detri- 
mental if the delay is suffered under conditions permitting incompatible 
responses to occur than if the animal is closely confined during the delay. 


° There is a possible ambiguity in Hull’s definitions of habit strength in 1943. 
The verbal statement that habit grows at a constant fraction of its yet to be at- 
tained level corresponds to the mathematical statement given above only while 
is constant and trials are continuous from zero. Clearly, the mathematical state- 
ment would imply an immediate shift following a change in M and a complete 
shift if N were sufficiently large to make the parenthetical portion approximately 
1, i.e., precisely what the 1949 equation implies. The “constant fractional incre- 
ment” is the prior definition and prevails in case of conflict. 

” This relationship is not the same as would be expected from the assumption 
that delay affects the limit of habit, and that habit is reducible. In such an analysis, 
only one variable is changing with N; in the rsss analysis, two variables are 
changing during original learning but only one following a change in delay. 


ptions 
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Finally, he argues, the change from a short to a longer delay of reward 
did not produce a performance decrement in the Harker study in which 
the subjects learned to stay at the response mechanism when initially 
trained with the short delay and then did not perform incompatible re- 
sponses during the subsequently introduced longer delay. 

Now it should be noted that an effect of incompatible responses is 
implicit in the Hullian system, although the analysis has not been made. 
Any responses occurring in a delay period will be reinforced by the sub- 
sequent reward. This provides, in effect, a fixed interval reinforcement 
schedule to the postresponse cues, and the responses which occur, even 
though causally independent of the reward,” are reinforced and learned. 
They will then tend to become anticipatory and compete with the instru- 
mental response, However, since these anticipatory responses are analo- 
gous to anticipatory errors, there should develop a discrimination between 
preresponse and postresponse cues, the limit of which will depend on the 
difference between them. Thus, if one could insure the same amount of 
incompatible behavior during a delay period by two groups, for one of 
which the postresponse cues are more different from the instrumental 
cues than for the other, the effect of the delay should be different. This 
implication, of course, would also follow from Spence’s theory. 

What, then, are the differences between the Spence and Hull treat- 
ments of delayed reward? First, it must be noted that the reinforcement- 
contiguity issue is not critical because of the presence of a reward. The 
analyses could differ on the effect of a delay imposed in the goal box 
after the reward, because incompatible responses occurring in such a 
period would not be reinforced and would, therefore, not be learned ac- 
cording to a reinforcement interpretation. A contiguity theory implies 
that they would be learned, but it can be argued that the prior reward 
will have changed the stimulus complex in order to protect the instru- 
mental responses from competition. Such an analysis might still imply, 
however, relatively greater detrimental effect of such a postreward delay 
the smaller the amount of reward. 

Disregarding those variables that are not relevant to the present ques- 
tion, the equations may be written: 

Hull sEr = sHr' DJ: K 


Spence E=HD+KR)-I 


In these equations, sEr and E represent reaction potential, sr and H 
are habit strength, and D is drive. Hull separated incentive into two 
variables, J for delay and K for amount of reward; K in the Spence 


* Skinner [161] has appropriately called these “superstitious” behaviors; see also 


[81], 
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equation is primarily dependent upon the amount of reward, * although 
delay may also affect it. The major effect of delay acts through J+ which 
is presumably based on incompatible responses occurring in the delay 
period. 

Comparing these equations, if delay affects Spence’s K, it is closely 
comparable to Hull’s combined J and K. However, as we have already 
noted, Spence is particularly tentative on the matter, suggesting that de- 
lay might work exclusively through J+. It should be noted that an 79-59 
analysis of incentive would be most consonant with some effect of delay 
per se, but this is an issue which, while clearly formulatable, is not easily 
answered. From the fact that increasing the opportunity for incompatible 
responses increases the detrimental effect of delayed reward, one cannot 
confidently extrapolate that there would be no effect without any incom- 
patible responses. Attempting to eliminate from the data trials on which 
incompatible responses occur is debatable not only because slightly “in- 
compatible” responses might not be observable but also because of the 
danger of eliminating any inferior performance on the grounds that it 
must have included incompatible behavior. The Harker data, showing 
no drop in performance following an increase in delay of reward, are the 
most embarrassing to the assumption that delay affects incentive. How- 
ever, the fact that subjects will learn to select that response leading to the 
shorter delay of reinforcement would appear to favor the incentive ap- 
proach. If the only effect of delay is through incompatible responses, then 
some analysis would have to be provided to show how these responses 
mediate selection of the shorter delay response. 

Actually, however, Spence has avoided such problems because, al- 
though he discussed incompatible responses as the mechanism through 
which delayed reward affects performance, he formally assigned the 
quantitative effects of delay directly to Js. Therefore, his formulation 
does not depend on the incompatible response analysis, so long as the 
role of delay of reinforcement in determining reaction potential can be 
described by his equation. 

Several interesting implications follow from this equation if delay 
affects only It. First, the interaction between amount and delay of re- 
ward should be additive, rather than multiplicative as assumed by Hull 
and as would be implied by Spence’s theory if delay were also to affect 
his K variable. This interaction has not been studied experimentally as 
yet. Also implied in Spence’s equation is an additive interaction with 


= Spence’s K is also affected by a different kind of delay than that being dis- 
cussed. When the instrumental response is broken down into segments, or links of 
the behavior chain, early links have a delay determined by the time involved in 
completing the remaining links. This chaining delay affects K presumably through 
the ro-s7 mechanism. 
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drive, which has received experimental support [142]. Finally, a more 
complex interaction is between delay and the instrumental habit. This 
interaction is postulated as additive, but since N also affects It (through 
the incompatible response habit strength), testing of this implication 
must be more roundabout, if at all possible. 

Let us now summarize the various treatments accorded delay of re- 
inforcement. In the several formulations, delay has been conceptualized 
in terms of (1) a limiting factor for habit strength, (2) a direct incentive 
factor, and (3) an inhibitory factor. Spence has suggested the 9-5» 
mechanism for the incentive approach, and incompatible responses as 
the mechanism underlying the inhibitory approach. From these several 
alternatives, a number of contrasting implications follow. 

Thus, Hull’s original assumption that delay of reinforcement affected 
the limit of habit strength was changed because of the effects of increas- 
ing the time of delay, although quantitative study of such shifts might 
support the original approach were habit permitted to decrease [38]. 
From the incentive and inhibitory approaches come further questions in- 
viting detailed experimental test: the interactions of delay with amount 
of reward, and with trials of the instrumental habit, the effects of incom- 
patible responses, the relative effects of delay on different responses, and 
the like. It is the answers to such questions which permit the theorist to 
select wisely from among various possible theoretical approaches. The 
active existence of a number of alternative interpretations can be traced 
largely to the deficiency of such systematic data. 

The intervening variable relationships. In his last postulate set, Hull 
combined the intervening variables to determine reaction potential 
substantially as follows:** 

(Hr: DoVe K+ J) — sla — IR sOr — slr 
This equation is applicable to both classical and instrumental condition- 
ing. In it, sHr (habit) is determined by the number of reinforced trials, 
D (drive) by the conditions of drive such as food deprivation, yV 
(dynamism) by the intensity of the stimulus, and K and J (incentive) 
respectively by the amount and the time of delay of reward. 


“This equation gives the spirit of Hull’s equation, but omits some of the 
characteristics which appear to involve problems Hull did not fully resolve, For 
example, Hull had two Vs in his system, one Vs for the original training stimulus 
and another V: for the testing stimulus. Not only does this have undesirable math- 
ematical consequences, but requires additional specification of what to do with 
V: during training, and how long to carry V: during testing. Also, the delay of 
reinforcement has been included through Hull’s J, although his last equation looks 
somewhat different. And the inhibitory variables have been removed by arithmetic 
subtraction rather than by behavioral subtraction because of the effect the latter 
would have on extinction curves. 
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Opposing these excitatory variables are two inhibitory ones: sIr 
(conditioned inhibition) and Jr (reactive inhibition). The latter is 
dependent on the work done since rest, and the former is a conditioned 
tendency to inhibit the response based on the reinforcement (reduction 
in Ir) occasioned by resting at the termination of the response.‘ The 
remaining two variables are calculational devices and are not tied to 
empirical variables. sOr is a random variable inducing an oscillation of 
reaction potential from moment to moment, and sZr is the reaction 
threshold below which the response cannot occur. 

Spence employed two combinations for determining reaction po- 
tential: 


classical conditioning (H-D)-I,-I,-L 
instrumental conditioning (H:(D+K))-h-l,-1,-L 


As in the Hull system, H (habit) is related to the number of reinforced 
trials for classical conditioning, but unlike Hull’s system, the instru- 
mental H is dependent simply on the number of trials. D (drive) is again 
dependent on conventional deprivation and noxious stimulus conditions. 
K (incentive) explicitly includes the number of reinforcement experiences, 
and is partially dependent on the amount of reinforcement. In addition, 
Spence has clearly related K to within-chain delay, which refers to the 
time between the completion of one link of a behavior chain and the 
completion of the remaining links. Nonchaining delay, which is the time 
after the entire instrumental response is completed until the reward is 
received, may affect K but Spence expressed doubt about this. 
Nonchaining delay is the major variable determining J+, and since 
incompatible responses are the possible mechanism underlying Z+, it also 
includes the opportunities available in the situation for incompatible be- 
havior to occur and the number of delayed-reward trials on which such 
behavior might get learned. In is the analogue of Hull’s s/z and, for 
classical conditioning, is tied directly to the number of nonreinforced 
trials. For instrumental conditioning, however, In expresses the quantita- 
tive effects of incompatible responses which occur during extinction and 
are then learned by contiguity; but since this mechanism is not yet 


“ Those interested in this kind of mediated conditioned inhibition as a derived 
principle should note that „H, is no longer affected by the amount of reward, 
hence lę would have to be conceived as having some kind of incentive component 
if the amount of Ip reduction is to affect it. j 

* The intensity of the UCS, which determines the amount of reinforcement 
classical conditioning, could enter as affecting the limit of H (the assumption 
Hull took in 1943), or as evoking a unique amplitude of response (the micromolar 
assumption). Although leaning toward the latter, Spence followed his characteristic 
approach of reserving the decision until more definitive data have been collected. 


The Hull-Spence Approach 325 


fully developed, In is formally related to the number of extinction trials 
in instrumental conditioning. 

The two calculational devices, Io (oscillation) and L (threshold), 
serve the same function for Spence as for Hull. However, Spence has also 
assumed that both of these can be affected by the experimental condi- 
tions. Thus, oscillation is at least partially dependent upon variations in 
the effective stimulus and in drive which, while never identical from 
trial to trial, may be better controlled in some situations than in others. 
The threshold depends on the nature of the response so that, for example, 
if a rat were required to jump across a gap in order to get to the goal 
box, the threshold would vary with the length of the gap. Nevertheless, 
these variables are constant for groups of subjects in any particular 
situation. 

We have already discussed the major contrasts between the Spence 
and Hull treatments of the input variables. In so far as the relationships 
among these variables are concerned, the most critical difference is that 
Spence has assumed that the motivational variables in instrumental 
conditioning (drive and incentive) interact additively, whereas Hull as- 
sumed that this interaction was multiplicative. In general, a multiplicative 
interaction implies that if any variable is zero, then the product is also 
zero. Thus, both drive and incentive must have some appreciable value 
in order for the response to occur. An additive interaction, on the other 
hand, implies that neither drive nor incentive is a necessary condition of 
performance, although at least one must be present. In the studies 
directed at this question, the data favor the additive relationship assumed 
by Spence. There have not been enough of these studies, however, and 
those conducted have not included near-zero values of either variable. 
Therefore, the answer to this question is still in doubt. 

The output variables. After the input intervening variables have been 
determined from the appropriate independent variables and have been 
combined into reaction potential, the chain must be completed by relating 
reaction potential to the response. Technically, one calculates the ex- 
pected response 7 of the model which is then related to the response R 
of the organism.'° But since r and R are assumed to be isomorphic, it is 
conventional to go directly to R from reaction potential. 

Behavior, as the dependent variable is psychological research, may 
be measured by a variety of objective operations: latency, speed, ampli- 
tude, magnitude, force, duration, rate, resistance to extinction, prob- 


* The logic of this step is more clear when 7 and R are not isomorphic. Con- 
sider an electrical representation of a mathematical model. The theorist might not 
build the model so as to vary in speed of response, letting some other attribute T, 
say the number of flashing lights, indicate the speed R at which the organism is 
expected to respond. 
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ability, errors, trials to criterion, etc. These are the potential systematic 
dependent variables, of which four, latency (or speed), amplitude, re- 
sistance to extinction, and probability, have been explicitly related to 
reaction potential. Per cent correct responses in trial-and-error and 
discrimination situations has been related to the difference between the 
competing response tendencies. Presumably, other response measures can 
be derived from these, whereas others may require additional translations 
of reaction potential. 

As with the independent variables, adequate definitions of the 
systematic dependent variables have not been made at a high level of 
abstractness. There are formidable problems to be resolved when one 
attempts to formulate such definitions. For example, latency is the time 
required to initiate a response. If one is running rats in an alley, one 
must decide how rigidly to control the subject’s orientation at the time 
the door is opened, whether to provide any distinctive ready signal, and 
when to define the response as having begun. Certainly one’s measures 
are affected by these decisions. And comparable problems of selecting 
criteria are involved for each response measure. 

These problems are of major significance when one is attempting 
fully to quantify the theory, They are of less importance for greater-than 
kinds of studies provided the criteria used are the same for groups being 
compared. Thus, any condition which leads to shorter latencies defined 
as the time to get 2 in. out of the start box will most likely show the same 
relationship if latency is defined as the time to get 4 in. out of the start 
box. Accordingly, one can presumably interpret the definitions of the 
response measures quite liberally for most situations. 

Latency is the time required to make the response after the stimulus 
is presented. Under typical conditions, time can be measured through 
any specified stage of the response, including time to complete it. When, 
however, the response is a fairly long behavior chain, it is frequently 
broken down into time to complete arbitrary links of the chain. It is now 
becoming conventional to take the reciprocals of these time scores and 
report data in terms of speed, since this emphasizes differences in the 
short time range and inverts the measure to one that has a positive 
relation to reaction potential. 

Speed is used predominantly in instrumental conditioning, instances 
being the speed of running an alley, pressing a bar, etc. Hull simply 
postulated a function relating reaction potential to response speed, but 
Spence [179] has since shown how it can be derived in terms of the time 
required for a superthreshold reaction potential to chance to occur [see 
also 183]. That is, the response takes place when, as reaction potential 
oscillates from moment to moment, a superthreshold value occurs. 

There is an advantage in deriving the response measure from prop- 
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erties already attributed to reaction potential over introducing a new 
postulate for this purpose. Clearly, one gains in parsimony; the theorist 
loses a degree of freedom in that he can no longer correct the theory at 
the terminal end by simply postulating whatever function works best. 
This is very much like the anchoring of the input variable through 
mechanisms. That is, oscillation and the threshold may be viewed as 
mechanisms through which reaction potential gets translated into certain 
measures of performance, in this case, the latency or speed of response. 

These same mechanisms also apply to probability of response. This 
measure is most commonly used in classical conditioning, where per- 
formance is monitored over a finite interval of time to determine whether 
or not a response occurred. Prediction concerning probability is made 
by reference to the likelihood that reaction potential will chance to be 
superthreshold sometime during the selected interval of time. 

The third measure which has been explicitly considered is amplitude. 
Both Hull and Spence restricted this measure to classical conditioning, 
where it refers to such things as the amount of change in skin resistance, 
the extent of eyelid closure, etc. Analogous measures in instrumental 
conditioning, such as the force the rat exerts on a bar, were excluded be- 
cause of the possibility that particular amplitudes might be learned as 
different micromolar responses. No mechanism of amplitude has been 
proposed, its relation to reaction potential being simply postulated as a 
linear function of the reaction potential present at the moment of 
evocation. 

Similarly, a separate postulate was given relating reaction potential 
to the number of consecutive nonreinforced trials required to extinguish 
the response. Actually, one mechanism for resistance to extinction is 
already available, namely, the amount of inhibition necessary to reduce 
reaction potential to threshold. However, extinction may involve a 
number of complications: the drive may be changed, frustration may be 
introduced, varying potentiality of competing responses may exist, the 
effective stimulus complex may be different, spontaneous recovery may 
be involved, etc. Extinction must be studied in greater systematic detail 
before a satisfactory mechanism can be proposed. 

The above four measures apply to classical and instrumental condi- 
tioning, where records are taken of only one response class and discrete 
trials are given. In the free-responding situation, where the subject is left 
in the apparatus and can respond whenever he chooses, the typical 
measure is response rate, The interpretation of reaction potential in terms 
of this measure is an important undertaking but it has not yet been done. 
Rate would seem most clearly related to speed, since the faster one 
response is made, the sooner the next can occur, and so on. However, 
there are other complications involved in the free-responding situation, 
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especially with a ratio schedule of reinforcement. Conditions which pro- 
duce higher rates thereby lead to greater reduction in drive and more 
accumulated reactive inhibition as well as more reinforced trials. Clearly, 
a large comparative study will be required to relate response rate to 
measures used in the Hullian approach. 

Turning now to situations in which records are kept concerning 
several response classes, two kinds have been considered. One is trial- 
and-error learning, where the subject selects from among competing 
responses; the other is discrimination learning, where the selection is 
among stimuli. A T maze can be used for both: in trial and error, the 
subject is required to go to the same side consistently; in discrimination, 
the arms of the maze are different in some way (say, white and black) 
and the subject is required to enter, say, the white alley regardless of 
its position. Differential reinforcement is given, typically reward for a 
correct response and no reward for an incorrect response, although other 
differences, such as work, punishment, etc., may be involved. 

Treatment of both situations is handled through the oscillation 
mechanism by assuming that that reaction will occur which has the 
highest potential at the moment. The subject will choose the correct 
response every time only if its reaction potential exceeds the competing 
one by an amount greater than the range of oscillation. Otherwise, 
standard statistical procedures are used to determine the probability of 
the correct response, based on the difference between the reaction 
potentials and the variance of the oscillation distribution. 

Various other measures can be derived from the ones described. 
For example, the number of trials to reach a performance criterion in 
a trial-and-error situation, say nine out of ten correct, is translated into 
the number of trials required to produce reaction potentials which are 
sufficiently different to support the criterion performance level. 

In principle, a completely adequate, comprehensive behavior theory 
would be able to make predictions concerning any response measure an 
experimenter might find interesting. The Hullian approach can, indeed, 
already deal with a wide range of measures in the simpler learning 
situations. However, if one considers such a complex response as talking, 
there are not only probability, speed, and force, but also intonation, 
rhythm, volume, pitch, and all the other attributes by which one 
effectively communicates. It is, therefore, clear that, as with all other 
aspects of the theory, the anchoring on the output side is a continuing 
task. 

Degree of formalization. Psychologists face a special hazard arising 
from the fact that they are themselves behaving organisms. Physical and 
natural scientists do not have quite the same familiarity with their sub- 
ject matter and so can more easily avoid intuitive and implicit ideas 
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based upon their own introspective experiences. In contrast, the success 
of most psychological theories, including the ones here under discussion, 
depends partly on the theorist’s shrewd guesses in analyzing a situation 
and weighing the various factors involved. 

There is only one way to insure that there are no critical implicit 
assumptions, and this is to insist upon a formal statement of the theory. 
If all terms are unequivocably defined, all postulates precisely formulated, 
and all steps in the derivation of theorems given in detail, then there can 
be no serious doubt as to what the theory implies. To the extent that 
these goals are not met, there is room for side-stepping embarrassing 
phenomena, predicting on grounds other than provided by the theory 
and providing attractive postdictions. 

There are other undesirable attributes of relatively informal theories, 
A notion can be used to advantage in one place and omitted, or at least 
given a quite different interpretation, in another place that is equally 
relevant. An example in Hull’s theory is the need for short-lived stimulus 
traces in classical conditioning, but persisting traces for various sequential 
effects of repeated trials. Another result is that inconsistencies may seem 
more tolerable. For example, Hull derived conditioned inhibition as a 
habit, but whereas other habits are permanent, conditioned inhibition 
cannot be. And, in general, it is difficult to detect the degrees of incom- 
pleteness and redundancy. 

However, even granting all these things, it is simply not practicable 
at the present time to write a fully formalized behavior theory of any 
general significance. Certainly none of the theories we have considered 
meets this goal, although the deficiencies are less apparent if one restricts 
them to the limited range of situations for which they were originally 
constructed. 

There would be no particular virtue in attempting to identify all 
the difficulties in the various postulate sets without, at the same time, 
offering steps toward their resolution. The fact that the theories are not 
fully formalized, however, has the unfortunate result that it is difficult 
to master them solely on the basis of extant writings. Nevertheless, it does 
permit the theorist to fractionate the task of theory construction since 
tentative conventions can be used concerning problems that have not 
been resolved. For example, it is perfectly clear that animals can develop 
Specific hungers based upon a deficient diet, but it is currently con- 
ventional to provide a balanced diet and consider hunger as general 
tather than specific. Furthermore, a less formal system provides the 
flexibility and modifiability necessary at the present stage of psychological 
knowledge, since one can hold a number of alternatives concerning the 
elements in which he is primarily interested while collecting the data 


designed to select among them. 
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It is impossible to hold completely in mind a theory about such 
complex phenomena as behavior provides; the theory must be built, 
written, and read in some order, with only a hazy impression of what is 
going to happen in the next section. Almost necessarily, new problems 
will develop as one goes along, and these frequently reflect back upon 
earlier solutions. The theorist must, therefore, come full circle and begin 
again, ideally bringing new data as well as new problems to the task. 
Hull saw particularly clearly that this cycle can go on a great many times 
without the theorist’s achieving a fully satisfactory formulation and that 
one must simply accept the existence of inadequacies if one is to publish 
at all. 

All other contributions considered, the most significant heritage of the 
Hullian approach is that it got the enterprise vigorously under way. 
Probably every author has experienced the sense of accomplishment 
when his work is first roughly laid out. In the present larger setting, 
neither Hull nor Spence has claimed even to have finished this phase, 
much less to have gone beyond it. ' 


THE HULLIAN APPROACH AND EMPIRICAL RESEARCH 


We turn now to a consideration of the Hullian approach in action, 
to the role it has played in stimulating research, and to the degree of 
success with which it has handled the resulting data. To do this com- 
pletely, however, would require considerably more time and space than 
presently available. Most of the articles concerning simple learning 
phenomena that have been published during the past decade have made 
explict reference to Hull’s theory or to Spence’s many theoretical ideas. 
Accordingly, few areas in the experimental psychology of learning would 
not reasonably be included in an exhaustive survey of work relevant to 
the S-R reinforcement approach. 

Therefore, all relevant research areas cannot be treated even briefly. 
It seems appropriate to select those which have been foci of controversy 
between the Hullian approach and its various alternatives. These contro- 
versies have been discussed by a number of writers, and this survey will 
lean heavily on citing the references where a more detailed treatment 
of the issues can be found. The problem of correlated reinforcement 18 
included because of its importance and the writer’s interest in it and 
is discussed in somewhat greater detail because it has not yet been dealt 
with very much in the literature. Admittedly, this selection leaves out 
such complex phenomena as perception, problem solving, etc., with which 
the theory currently is less concerned. Indeed, even presumably simpler 
phenomena such as exploratory behavior are omitted because they have 
not yet been definitely argued among learning theorists. One can object, 
therefore, that the most embarrassing data have been excluded. 
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Perhaps, in reply, the writer’s attitude on such matters should be 
made explicit. It is an attitude which emphasizes disproving (and thence 
improving) one’s own theory rather than someone else’s. That a theory 
can or cannot adequately handle an apparent contradiction today may 
arise from so many sources that it is hardly wise to single out one with 
confidence. Thus the postulates may be in error, but in quite another part 
than apparent at first glance. The derivational procedures may be faulty 
or may not yet have been worked out in detail. The analysis of the sit- 
uation may have incompletely specified the important boundary condi- 
tions. Indeed, the phenomena may not even be genuine, but involve 
various artifacts subsequently to be discovered. Our theories are not 
sufficiently explicit nor our laws sufficiently certain to support con- 
clusive disproofs. For that matter, Conant [33] has correctly observed 
that theories are overthrown by better theories, but not by contrary 
evidence. 

The areas described are, therefore, not presented as being crucial but 
rather as having occupied a prominent place in the recent history of 
learning theories. They are concerned with important questions, the an- 
swers to which partially characterize the theoretical approach. It is not 
intended, however, to discount the alternative positions, but only to sug- 
gest that, viewed across all of the issues, the S-R reinforcement approach 
appears to be at least holding its own. 

Reinforcement (as drive reduction) vs. contiguity in classical condi- 
tioning. A one-factor S-R reinforcement theory assumes that reinforce- 
ment is a necessary condition for successful classical conditioning. An 
alternative approach holds that the temporal contiguity of the condi- 
tioned stimulus CS, or its trace, with the unconditioned stimulus UCS 
is not only necessary but itself sufficient. Since Hull adopted a specific 
drive-reduction hypothesis about the nature of reinforcing events, one 
issue that has received considerable attention concerns the hypothesized 
necessity for drive reduction in classical conditioning. 

Among the theorists proposing the alternative position are Guthrie 
[e.g., 57], Tolman [187], Maier and Schneirla [114], Birch and Bitter- 
man [11], and Mowrer [e.g., 128] in so far as autonomic conditioning is 
concerned. In addition to these writers, the issue has been discussed by 
Hilgard and Marquis [65], and by Spence [175] and Kendler and 
Underwood [84]. k 

Data favoring the drive-reduction view have been provided by 
Loucks [106] and Loucks and Gantt [107]. If the unconditioned leg 
flexion in dogs was evoked by direct electrical stimulation on the motor 
side of the central nervous system, then there was no conditioning to a 
buzzer CS. Conditioning did occur under these conditions only if a re- 
ward (food) was given each trial. Furthermore, conditioning occurred 
without this extra reward if the electrical stimulus was applied on the 
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afferent side, where irradiation to pain tracts could provide subsequent 
pain reduction. 

An argument opposing the drive-reduction view stems from work by 
Mowrer [127], Mowrer and Aiken [129], Mowrer and Solomon [132], 
and Bitterman, Reed, and Krauskopf [13]. They have concluded that the 
form and temporal properties of the termination of a noxious UCS and 
presumably, therefore, the drive reduction do not affect classical defense 
conditioning. This argument has been challenged, however, by Miller 
[121, 122] and Kendler [83]. Further, using the drive-reduction princi- 
ple, Zeaman and Wegner [206] have shown that whether the conditioned 
heart response in humans is acceleration or deceleration depends on how 
the heart rate was changing at the time of the shock (UCS) offset. 

Spence [180] reported data favoring drive reduction in the condi- 
tioning of the human eyeblink. For example, under conditions which 
equate level of drive, conditioning is superior with a strong rather than 
with a weak UCS (airpuff). Thus, it appears that the evidence con- 
sistent with the drive-reduction hypothesis is sufficient to justify its con- 
tinued consideration. 

Reinforcement (as some reinforcing event) vs. contiguity in classical 
conditioning. The theorist need not adopt any specific hypothesis about 
the nature of reinforcement (identifying reinforcing events by previous 
demonstrations of their reinforcing property), yet still assume reinforce- 
ment to be a necessary condition for successful classical conditioning. 
Spence has adhered to this position, pointing to the occurrence of the 
UGS as the normal source of reinforcement. Accordingly, the reinforce- 
ment-contiguity issue in this sense revolves around the hypothesized 
necessity of the UCS, or some other identified reinforcing event, in 
classical conditioning. 

That the UCS need not be given each trial is now a clearly estab- 
lished fact. Since the early experiments of Humphreys [75, 76], a num- 
ber of experiments have involved partial reinforcement as summarized by 
Jenkins and Stanley [79]. That partial reinforcement produces only 
moderately slower learning, only a slightly lower asymptote, and a con- 
siderably greater resistance to extinction, appears to contradict a rein- 
forcement interpretation. Several possible solutions to each of these, how- 
ever, have been provided. 

In so far as the rate of learning is concerned, the reinforcement 
theorist points to other identified sources of reinforcement on trials with- 
out the primary reinforcement. Mowrer [125] identified the derived pri- 
mary reinforcement (reduction of the fear conditioned to the CS) in 
avoidance conditioning, an analysis equally applicable to defense con- 
ditioning. In so far as asymptotic performance is concerned, one can ap- 
peal to the factor of varied amount of reinforcement, or one can assume 
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that the inhibition produced by nonreinforced trials is partially reduced 
on subsequent reinforced trials, Hull [69] provided an analysis of re- 
sistance to extinction following partial reinforcement based on the per- 
sisting traces of nonreinforcement being a component of the stimulus 
complex. Accordingly, subjects trained with partial reinforcement have 
learned to respond to a stimulus complex which includes traces of a 
preceding nonreinforcement and which is, therefore, more similar to the 
stimulus complex during extinction. An additional possibility was sug- 
gested by Logan, Beier, and Kincaid [105] based on the number of non- 
reinforced responses which may occur on any one trial. Following con- 
tinuous reinforcement, the subject may respond several times to the trace 
of the CS during an extinction trial, and accumulate inhibition from 
each such nonreinforced response. Partial reinforcement provides extinc- 
tion experience during training to these older ages of the CS trace, 
thereby reducing the number of responses per trial and increasing the 
number of trials necessary to inhibit the response. This can be accom- 
plished not only by partial but by varied reinforcement, as reported by 
these writers, by Crum, Brown, and Bitterman [35], and by Peterson 
[138]. 

Tt should be noted that each of these analyses identifies the occurrence 
of the UCS as a necessary condition in the sense that its continued omis- 
sion eventually results in extinction. That is, for example, the presumed 
fear reduction which reinforces the CR on non-UCS trials in partial 
defense conditioning depends on fear which is itself a conditioned re- 
sponse based on the UCS and accordingly subject to extinction. 

This is illustrated in the special case of partial reinforcement in which 
the UCS is omitted on just those trials when an anticipatory CR occurs. 
This avoidance conditioning procedure was found by Brogden, Lipman, 
and Culler [25] to be superior to nonavoidance conditioning, a fact 
which appears to be a problem for reinforcement theories [see 66]. The 
fear analysis of avoidance conditioning, described by Mowrer [125] and 
Mowrer and Lamoreaux [131] and challenged by Eglash [39] and 
Ritchie [143], is dependent on the reinforcement provided by the UCS 
and implies that the CR will suffer extinction during successful avoidance 
trials. This implication has received support by Schlosberg [146] and 
Sheffield [156]. Sheffield also found that the poorer performance by the 
nonavoidance subjects in the Brogden, Lipman, and Culler experiment 
resulted from counterconditioning, an analysis consistent with the ex- 
perimental findings of Logan [98] where counterconditioning was not 
involved and avoidance was inferior to nonavoidance.”* 

The reinforcement interpretation of classical conditioning, where re- 


* Avoidance conditioning might also be viewed as a case of correlated rein- 
forcement in classical conditioning, i.e., short latency responses are nonreinforced. 
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inforcement is viewed in general as some reinforcing event and in par- 
ticular as the occurrence of the unconditioned stimulus, therefore seems 
consistent with the findings in these research areas to date. 

Reinforcement vs. contiguity in instrumental conditioning. The the- 
orist can take one position on the reinforcement-contiguity issue for classi- 
cal conditioning and a different position for instrumental conditioning, 
Accordingly, a separate problem concerns the necessity of reinforcement 
in instrumental conditioning. 

This issue has been among the most popular for discussion since the 
initial formulation of the law of effect. Among the theorists holding the 
contiguity alternative are Guthrie [57, 58], Estes [43], Tolman [187], 
and Birch and Bitterman [10]. Among the many analyses of the issue are 
Allport [1], Carr [32], Hilgard and Marquis [66], Hilgard [64, 65], 
Koffka [89], and Meehl [120], and also Maltzman [115, 116], Miller 
[122], Mowrer [126], and Spence [175]. Waters [195] and, more 
recently, Postman [140] have provided larger historical surveys of the 
issue, 

The research area that has been most deeply involved in the contro- 
versy between reinforcement and contiguity interpretations of instru- 
mental conditioning is so-called latent learning, although this phenom- 
enon probably is more relevant to the response-learning S-R vs. stimulus- 
learning S-S issue. Since the early study by Blodgett [16], over fifty 
studies of latent learning in its several forms have appeared in the psy- 
chological literature, half or more of which have reported positive re- 
sults in the sense that when subjects were for the first time appropriately 
motivated and/or rewarded, their performance suggested some adaptive 
effect of prior experience. 

We shall not attempt to summarize here this large body of data, since 
summaries and analyses may be found in articles by Birch and Bitterman 
[10], MacCorquodale and Mechl [110], Seward [148], Spence [175], 
Thistlethwaite [184], Tolman [188], and Wolpe [204]. Despite this large 
amount of research and analytical activity, a systematic delineation of 
just what the empirical phenomena are has not yet been clearly formu- 
lated. 

The S-R reinforcement theory requires some reward for learning to 
occur. But animals who have been given experience in a maze without 
the presence of a known primary reward may, when reward is later 
given in the goal box, very quickly learn to run the maze without error. 
This could be treated as extinction of the exploratory response to the 
culs of the maze, thereby facilitating learning by reducing the incorrect 
competing response tendencies. The more common interpretation is that 
learning (rather than unlearning) occurs during the preliminary €x- 
posure, and the reinforcement theorist must, in this case, assume that 
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some (admittedly still unspecified) source of reinforcement is available 
[see 110]. The fact that performance typically improves without ap- 
parent reward is consistent with this assumption, Following this tack, 
a change from no apparent reward to reward is a change in amount 
of reward which would now be handled through an increase in the in- 
centive variable, Consistent with this, Barry and Logan [4] found that 
the rat’s speed of running down a straight alley increased at the same 
relative rate following introduction of reward as following an increase 
in amount of reward. 

Latent learning has also been demonstrated by running rats under 
some identified motive-reward condition, and letting them see an 
irrelevant goal object somewhere in the maze. When subsequently 
motivated for this goal object, rats sometimes go directly to the. place 
where it had been seen. This phenomenon has been handled by the use 
of the fractional anticipatory goal response, assuming that even an 
irrelevant goal object evokes some 7, [181]. When the drive conditions 
are changed, this ry is enhanced and mediates incentive for responses 
leading to the now desired goal object. 

There is no question but that such latent learning phenomena haye 
forced a more detailed elaboration of the S-R reinforcement theory, It 
appears to this writer that what were taken as fundamental differences 
between learning theories are becoming markedly attenuated as such 
elaborations are made. 

Response learning vs. stimulus learning in classical conditioning. The 
present approach assumes that the association formed in classical condi- 
tioning is between the trace of the CS and the response evoked by the 
UCS. Other responses may, of course, occur to the CS through response 
generalization, but the direct learning is assumed to be an S-R con- 
nection. An alternative approach argues that the association formed in 
classical conditioning is between the stimuli, and that the CR will, there- 
fore, be anticipatory behavior appropriate to the UCS. 

Although theorists have not been inclined to do so, there is no neces- 
Sary restriction against adopting an S-S theory for classical and an S-R 
theory for instrumental conditioning, or vice versa. Accordingly, the issues 
are separated here. Spence [174, 175] has analyzed the distinction be- 
tween S-S and S-R theories, and discussions of the general question may 
be found in Kendler [82], Campbell [31], Hilgard [63], Tolman [188, 
189], Smedslund [163], and Hilgard and Marquis [66]. 

One sort of argument concerns the similarity between the CR and 
the UCR. If the association is with the response evoked by the UCS, 
it is argued, then the CR should be a reproduction, at least in fractional 
form, of the UCR. Although this is frequently the case, some experi- 
menters [e.g., 21, 22] have noted qualitative differences between the 
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CR and UCR, and Wickens [198, 199] has shown the reorientation of 
the subject’s hand may change the CR in human finger withdrawal con- 
ditioning from flexion to extension. However, until adequate definitions 
of the response—including mediating responses as emphasized by Dollard 
and Miller [37]—and of response generalization have been given, it is 
impossible to determine whether such results are incompatible with an 
S-R approach. 

Another area of empirical research called forth in opposition to an 
S-R theory is that of sensory preconditioning. If two stimuli, A and B, 
neither of which evokes the response subsequently introduced, are pre- 
sented contiguously a number of times, and then a response is con- 
ditioned to stimulus B, one may find that stimulus A will also evoke this 
response. Such sensory preconditioning has been reported by Bitterman 
et al. [13], Brogden [20, 23, 24], and Karn [80]. Silver and Meyer 
[159] reported that the asynchronism between A and B follows a 
function similar to that found in conventional classical conditioning, and 
Wickens and Briggs [200] have shown that sensory preconditioning may 
involve mediated or secondary stimulus generalization. More detailed 
discussions of the area may be found in Miller [122] and Sheffield [157]. 
It seems clear that further formalization of the definition of the response 
is necessary before the approach can adequately handle the data from 
these areas. 

Response learning vs. stimulus learning in instrumental conditioning. 
The present approach assumes that the responses instrumental in obtain- 
ing reward become attached to the stimuli coincident with their 
occurrence. As in classical conditioning, then, the direct learning is 
assumed to be an S-R connection. An alternative approach assumes that 
the subject learns a field map of the environment and then behaves in a 
manner appropriate to his needs. 

It should be noted that, even in an S-S theory, response learning 
must occur someplace, since the subject must “know” what to do. 
Tolman [189], for example, following the suggestion of MacCorquodale 
and Meehl [112], explicitly proposed as S-R-S theory, the expectation 
that R to one S will lead to another S, where R is a performance defined 
by at least two test situations. 

This is illustrated by studies in spatial learning. In one variant, after 
a response down one alley has been learned, that alley is blocked and a 
number of alternative alleys are made available. Tolman et al. [191] 
reported that subjects chose the most direct path to food, and argued 
that the learning therefore involved approaching the goal place rather 
than the specific motor response of entering the original alley. Gentry et 
al. [46] have reported contrasting results. Another variant is to com- 
pare the relative ease with which subjects can learn to make different 
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motor responses to get the same place as opposed to making the same 
motor response to get to different places. Tolman et al. [191] found 
place learning easier, although Blodgett and McCutchan [17, 18], Hill 
and Thune [67], and Thompson and Thompson [185] have found the 
opposite. Galanter and Shaw [45] have shown the importance of an en- 
vironment rich in external cues for successful place learning. 

Perhaps, as Seward [153] has argued, there is no real issue involved 
here. We have already noted that Spence’s and Hull’s formulations in- 
volving incentive motivation approach closely an expectancy position, 
and Tolman’s specification of his principles of performance reduces the 
primarily sensory appearance of his approach. From an S-R point of 
view, questions such as spatial learning help to determine what response 
was learned, not whether a response was learned, i.e., to what extent did 
the subject learn to make a specific motor response and to what extent 
a more molar response of approaching a distinctive cue. The S-R as- 
sumption is that he learns both, which suggests quantitative analyses of 
their relative strengths based upon such information as the distinctiveness 
of the external cues. 

Absolute vs. relational and configurational views of the stimulus in 
discrimination learning. Whenever an organism is required to respond 
selectively to the larger, brighter, etc., member of a pair of stimuli, the 
present approach assumes that each stimulus acquires a separate response 
tendency. These may, of course, be affected by stimulus generalization 
between them as well as by afferent stimulus interaction. One alternative 
approach assumes that the subject compares the stimuli and responds to 
the relationship (e.g., brightness) between them. Another alternative 
approach holds that the subject learns to respond appropriately to each 
stimulus configuration facing it. 

The research area most directly concerned with the absolute vs. 
relational view of the stimulus has been that of transposition. If the sub- 
ject is first trained to select (say) the brighter of a pair of stimuli, 
transposition is evidenced if he subsequently selects the brighter of a new 
pair of test stimuli differing in absolute brightness from the original 
Pair. Spence [164, 166, 169] has shown that his absolute position not 
only implies transposition based on the generalization of excitatory and 
inhibitory tendencies, but also that the degree of transposition should 
vary in gradient fashion as the difference between the training and test 
stimuli is increased. 

Studies of transposition, e.g., Ehrenfreund [41], Hunter [77], Kendler 
[83], and Thompson [186], have yielded data generally consistent with 
the Spence analysis. An opposing analysis can be found in James [78], 
and some of the studies reported by Kluver [87] are not currently sub- 
sumable under the absolute schema. These are ones in which subjects 
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who have learned to select (say) the larger of two circles subsequently 
select the larger of two test triangles. Presumably, however, the single- 
dimension analysis by Spence could be extended to deal with qualitative 
as well as quantitative stimulus changes, and this form of transposition 
would then also follow from stimulus generalization. 

Turning to the absolute vs. configurational issue, the opposing 
position has been stated by Gulliksen and Wolfle [56] and argued by 
Bitterman [12] and Weise and Bitterman [197]. This argument revolves 
around a comparison of the ease of learning simultaneous and successive 
discriminations.'® In a simultaneous discrimination situation, two alleys 
(say, white vs. black) are presented together equally often in the right 
and left positions. The subject is required to consistently select the 
positive (white or black) alley regardless of its position. In the successive 
problem, white and black are never paired, both alleys being white on 
half the trials and black on the other half. The subject is here required 
to go left when the alleys are white and right when they are black (or 
vice versa). 

The successive discrimination problem must be carefully distinguished 
from the single-presentation form of discrimination learning. The latter 
also does not involve joint presentation of the cues. But in that form, 
there is only one alley and the subject is rewarded for responding when 
it is one brightness and not rewarded if he responds when it is the other 
brightness. Therefore, in successive discrimination, each alley separately 
involves single-presentation discrimination, i.e., the left alley is positive 
when it is white and negative when it is black, while the right alley is 
black-positive and white-negative. j 

Spence’s [178] analysis of the successive discrimination problem 
pointed out that, since no solution based on single cues is available, 
patterning is involved. That is, the problem cannot be solved by always 
going left or right, or white or black. Instead, the subject must learn the 
patterns, “white on the right” and “black on the left.” Spence assumed 
that the subject can learn such a configurational discrimination if re- 
quired to do so, but that such learning should be relatively difficult from 
an absolute point of view. 5 

Studies of the relative ease of simultaneous and successive discrimina- 
tion [15, 29, 30, 178] have yielded conflicting results, and related studies 

“The discussion of this issue has emphasized the nature of the response while 
it is here discussed in terms of the stimulus. The absolute view holds that each 
stimulus element acquires an approach response tendency, and choice behavior can 
be predicted from these combined approach tendencies, Even when configuratio 
learning is required, the absolute view assumes separate response tendencies t0 
patterns or compounds of stimulus elements. In contrast, the configurational view 
holds that the total stimulus complex cannot meaningfully be decomposed, and that 
the subject learns distinctive responses to each total configuration, 
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(62, 97] suggest that the successive discrimination problem is a difficult 
one for rats, Accordingly, the question is still in doubt, and one can 
reasonably expect that additional work will force a further elaboration 
of the ways by which the absolute approach handles various kinds of 
patterning and relational learning. 

Continuous vs. noncontinuous interpretations of discrimination 
learning. Whenever a response is followed by reward, this approach 
assumes that an increment in excitatory strength is added to each eflec- 
tive stimulus trace coincident with the response. Thus, if the subject re- 
sponds to the brighter of two stimuli, and is rewarded, then his tendency 
to approach the brighter stimulus on the next trial is increased, The 
effective stimulus (that actually impinging on the appropriate sensorium ) , 
however, may change in the course of discrimination learning if the sub- 
ject must also learn to orient his receptors in such a way as to receive 
maximal differential stimulation. An alternative approach assumes that 
the subject tests “hypotheses” in a discrimination problem, and that he 
learns nothing except the correctness or incorrectness of the hypothesis 
then being tested, 

The alternative approach was initially suggested by Lashley [94] 
and has been adopted by Krechevsky [89, 90, 91]. According to this 
position, during the early trials when the subject may be testing an 
inappropriate hypothesis, his response tendencies to the relevant cues are 
unaffected, For example, if the problem is a black-white discrimination, 
and the subject is testing a left-going hypothesis, then he is learning 
nothing about the correctness of white or black. Therefore, no detrimental 
effect will result from reversing the discrimination (i.e. making the 
initially positive cue negative, and vice versa). Further analyses favoring 
the noncontinuous interpretation may be found in Lashley [95] and 
Tolman [188]. i 

Spence’s theory of discrimination learning [164, 167] clearly im- 
plies a detrimental effect of such a reversal unless the problem requires 
the learning of receptor-orienting acts in order to maximize differential 
stimulation and hence permit discrimination. His analyses of the issue 
[168, 175] have emphasized this point since it may account for the 
apparently contradictory findings of Krechevsky [93] and Lashley and 
Wade [96]. Indeed, Ehrenfreund [41] found a detrimental effect of 
reversal when differential stimulus reception was insured, but not when 
receptor-orienting acts were required, And the large bulk of studies [19, 
52, 113, 141, 144, 165, 171, 193] have yielded results favorable to the — 
Continuous interpretation. $ 

Correlated reinforcement. Correlated reinforcement refers to condi- 
tions in which some dimension of the reward is varied systematically 
with some quantitative dimension of the response. For example, a rat 
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can be given more food the harder he presses a bar in a Skinner box. 
Because such conditions have not been widely studied or described, and 


because they provide a particularly important and interesting research 
area, a somewhat more detailed description is necessary here. In doing: 
so, we shall use the force a rat exerts on a bar, and the amount of reward 


he gets for it, as illustrative, but it should be apparent that a similar 


description could be made for other response dimensions, such as speed, 
other responses, such as running an alley, and other reinforcement 
dimensions, such as the time of delay of reward. 


Under conventional conditions, the amount of reward is related to 


the force the rat exerts on a bar in a minimal, all-or-none fashion. The 


rat must press the bar hard enough to be considered a response and 
to activate the feeding mechanism, but increasing the force above this 


level has no further effect on the amount of reward. This minimum force 
can be varied, requiring different subjects to exceed different forces in 
order to get reward. A maximum may also be applied, so that force 
must fall within set limits in order for the reward to be received. All these 
kinds of conditions have been studied, and generally, rats show reason- 
ably good adjustment to the requirements which prevail. 

If one permits amount of reward to vary in more than an all-or- 
none fashion, a large number of additional possibilities appear. The rat 
can be given more reward the harder he presses the bar, or the softer he 
presses it, or for intermediate forces. More complex conditions can be 
used if reward varies for the same force of response. One example would 
be when hard presses get a large average reward, but with a large 
variance in amount, whereas soft presses get less reward on the average, 
but with little variance. 

This already enormous number of possible conditions of correlated 
reinforcement can be doubled by adding another dimension of the re- 
ward, say its delay. Delay can be varied with force in all of the ways 
that amount can: harder forces get the reward sooner, etc. And, of 
course, both delay and amount can be correlated simultaneously with 
force in any combination. 

Finally, if the bar response is performed on a discrete trial basis, 
another response dimension can be added, response speed. Reward can 
then be correlated with force-speed combinations. And there are many 
other quantitative characteristics of the response, such as the pattern of 
forces over a behavior chain on a ratio schedule, with which the reward 
can be correlated; but since few of even the simpler conditions have 
been systematically studied, it would be inappropriate to speculate about 
the more complex ones. 

The brief descriptions given, however, do emphasize the very small 
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range of reinforcement conditions which has thus far been studied in 
the laboratory. This in turn is important because the more complex 
conditions of correlated reinforcement are common in the natural en- 
vironment. This is most apparent in the case of skills where one must 
adjust the response so as to maximize reward: the force of a golf swing, 
the speed of a billiard shot, the timing of a musical performance, etc. 
But the rewards for many other responses depend upon quantitative char- 
acteristics of the way they are performed: the loudness with which one 
delivers a lecture, the vigor with which one punishes his children, the 
speed with which one applies the brakes of his car, etc. Indeed, most 
everyday behavior is geared to conditions of correlated reinforcement. 

Beyond this, correlated reinforcement is particularly relevant to the 
question of response definition. The nature of this problem and the role 
of correlated reinforcement in answering it are discussed in the next 
section. 

The macromolar-micromolar distinction. Hull’s 1943 postulates were 
explicitly restricted to conditions in which reinforcement was not 
systematically correlated with any response dimension. In A Behavior 
System, however, Hull briefly described a micromolar level of analysis 
with which to handle correlated reinforcement. At the micromolar level, 
quantitatively different behaviors (e.g., different forces) are treated as 
different responses rather than different strengths of the same response. 
Hull’s proposed solution was to consider different forces to be different 
responses whenever reinforcement is correlated with force, and to con- 
tinue using the classical macromolar approach otherwise. 

The writer [103] has argued exactly the reverse, that the classical 
macromolar approach is appropriate only for conditions of correlated 
reinforcement. Thus, the macromolar system can deal with response 
Speed because how soon the subject gets the reward is correlated with 
how fast he responds. However, since Hull’s proposal was based on an 
analogy with the treatment accorded qualitatively different behaviors, it 
will be useful to review the logic of this latter treatment. 

In predicting the behavior of a rat in a Skinner box, a relatively 
Molar response class, bar depression, is typically employed. All behaviors 
which involve an adequate movement of the bar are aggregated together 
Notwithstanding any qualitative differences among them. The basis of 
aggregation is that all such behaviors are indiscriminately and equally 
reinforced, Should, however, one choose selectively to reinforce only 
responses by the right paw, the appropriate response class then would be 

tight paw bar depression.” Although this latter class aggregates a 
Narrower range of behaviors, the aggregation is on precisely the same 
basis, namely that all members of the class are equally reinforced. Thus, 
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whether responses with the right paw belong in the same class with left 
paw responses depends upon whether they are differentially reinforced.” 

Hull proposed to deal with quantitatively different behaviors in an 
analogous fashion. Thus, different forces are treated as different re- 
sponses only if they are differentially reinforced. This is the micromolar 
level of analysis for conditions of correlated reinforcement. 

The analogy, however, breaks down under further comparison. Re- 
call that, on the qualitative side, unless right paw and left paw responses 
are treated as different, one does not distinguish between them either 
in recording data or in making predictions. Instead, because they are 
aggregated into the same response class, predictions and observations both 
concern the probability of occurrence of either paw behaviors indis- 
criminately. Contrariwise, on the quantitative side, when different forces 
are not treated as different responses, the macromolar analysis neverthe- 
less permits observations and predictions about force, now viewed as an 
index of response strength. Consistent application of the analogy would 
require that, at the macromolar level, one does not distinguish between 
forces at all and is concerned only with the joint probability of occurrence 
of behaviors of any force. 

At first thought, one might attempt to justify these divergent ways 
of treating quantitative and qualitative variations in behavior with the 
observation that force does, in point of fact, vary systematically with 
training. Is it not reasonable, then, to hold that this systematic variation 
reflects changes in response strength unless distorted by differential rein- 
forcement of force? Further thought, however, will show that analogous 
changes occur on the qualitative side with training, i.e., the response be- 
comes progressively more skillful and efficient. These latter changes, IN 
so far as one wishes to study them, are dealt with by the theory as dif- 
ferent responses, learned selectively on the basis of differential reinforce 
ment. There appears to be no logical distinction, on the grounds of 
systematic change with practice, between qualitative and quantitative 
dimensions of behavior, and the preponderance of quantitative response 
measures can be attributed to the availability of convenient, ordered 
units. J 

There would seem to be two approaches which are consistent with 
this analysis. The one that is most apparent in the above comments may 
be summarized as follows: the response class to be used is jointly de- 
pendent upon the interests of the experimenter and the conditions 0 
reinforcement. One may classify in a single macromolar class all be- 
haviors, differing both quantitatively and qualitatively, which are not dif- 

*What is meant by behaviors belonging in different response classes iS that 


separate excitatory tendencies must be calculated. There would presumably be 
response generalization between classes, depending upon the degree of difference. 
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ferentially reinforced, and predict only about the probability of occur- 
rence of any member of this aggregation indiscriminately. Within these 
upper bounds, if an experimenter wishes to observe and predict about 
qualitatively different behaviors within the macromolar class, then he 
must shift to a relatively macromolecular level, treating these behaviors 
as different responses. Similarly, should he wish to observe or predict 
about quantitatively different behaviors within the macromolar class, 
then he must shift to a relatively micromolar level in which these varia- 
tions are treated as different responses. 

The second alternative is more in keeping with the classical Hullian 
approach and involves treating quantitative and qualitative variations in 
behavior differently. Qualitative variations are handled in the same 
manner as above, i.e., the level of molarity is jointly dependent upon 
the conditions of reinforcement and the interests of the experimenter. 
Quantitative variations, however, are never treated as different responses, 
but instead are viewed as response measures which may be taken as in- 
dices of response strength. 

In following the second alternative, Hull excluded conditions of cor- 
related reinforcement, and Skinner [162] has used correlated reinforce- 
ment as one argument against the use of measures other than response 
rate. It is, therefore, necessary to describe in some detail the equilibrium 


_ solution to the problem of predicting behavior under conditions of cor- 


related reinforcement within the classical Hullian approach. 

The equilibrium solution. In correlated reinforcement situations, 
one must deal with the mutual interdependence of response and reward, 
i.c., not only does reinforcement affect response strength, and hence per- 
formance, but performance also affects the reinforcement received. These 
dependence relations, however, are temporally isolated, performance on 
one trial partially determining reward on that trial,’ and this reward, 
in turn, affecting performance on subsequent trials. An equilibrium 
Solution is appropriate for such situations. j 

system is in stable equilibrium when the predicted variable 
simultaneously satisfies the several constraints acting upon it and when 
any deviation from such a simultaneous solution leads to forces driving 
the variable back toward it. Consider the force a rat exerts on a bar 
under conditions in which increasing forces provide decreasing amounts 
of reward. His performance is constrained by the two functions shown 
in Fig. 1. On the one hand there is the incentive function relating force 
to amount, determined under conditions of uncorrelated reinforcement, 
which shows that the force the rat exerts on the bar increases as the 


We shall ignore the more complex conditions in which response on one trial 
etermines reward on some subsequent trial, where reward on one trial is deter- 
mined by cumulative performance over a series of previous trials. etc. 
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amount of reinforcement increases. On the other hand, there is the 
function describing the terms available, i.e., the amount that will be 
given depending on the force exerted. Simultaneous solution of these two 
functions enables determination of an equilibrium. 

Suppose, for example, that the rat should exert a force equal to Fı 
in the figure. By referring to the terms function, we determine that he 
will receive Mı amount of reinforcement. But referring an Mı amount 
back to the incentive function, we observe that the rat will exert a force 
less than Fı for that amount and so his performance will tend to shift 


Weak 


MM M 


Amount of reward 


Fic. 1. Graphic presentation of the equilibrium solution 
for correlated reinforcement. See text for a description 
of this solution. 


down toward a lower force more in keeping with the obtained reward. 
In an inversely analogous fashion, should the rat exert a force of Fs, the 
obtained amount Mb will be seen to be appropriate to a response force 
greater than F». It is only force Fo which will receive just that amount, 
Ms, which provides incentive for precisely that Fo force.”* 2 

A variety of further complications arises if this solution is generalized 
to other situations. One must deal with individual differences in incentive 


“In a situation in which the organism’s behavior shifts immediately to pre 
cisely the level appropriate for the reward obtained on the preceding trial, one 
would have to determine whether the system is an “exploding” one. Since, how- 
ever, the change in a rat’s performance is relatively gradual, this problem nee 
not be considered here. 
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functions, discontinuous terms functions, and other possibilities which 
complicate the analysis but which do not alter its basic logic. As a single, 
more complex example, let us briefly consider a situation in which the 
rat must press below a certain force in order to get reward, harder presses 
being nonreinforced. 

Now since the rat’s force will vary from trial to trial, he is viewed 
as being on a partial reinforcement schedule; as his average force in- 
creases, the percentage of reinforcements decreases. Clearly, then, if his 
average force is so high as to never involve presses below the cutoff, ex- 
tinction would occur and force would decrease, If, however, his average 
force were so low as to never involve presses above the cutoff, he would 
get reward every time and thus be led to increase his force. Between these 
two extremes, there is an average force which gets reward just frequently 
enough to maintain that force. 

This kind of condition is a particularly good one to point out the 
distinction between the macromolar and micromolar approaches. Take 
the instance where the reward is received on half the trials, which would 
occur when the median force is equal to the cutoff. By the classical 
macromolar approach, the response of “bar depression” is then rein- 
forced on a 50 per cent partial reinforcement schedule. By the micro- 
molar approach, “strong bar depressions” receive 100 per cent nonrein- 
forcement, and “weak bar depressions” receive 100 per cent reinforce- 
ment. In the former analysis there is a single response tendency and per- 
formance must stabilize at a level insuring only enough weak responses to 
provide a sufficient probability of reinforcement to maintain the response. 
By the latter analysis, weak and strong responses have separate response 
tendencies and the performance level must insure only enough strong 
responses to maintain the response discrimination. 

This description has been intended to show that the classical macro- 
molar system can deal, in principle at least, with a wide variety of 
conditions of correlated reinforcement by means of an equilibrium 
solution, Indeed, there are other ways to view correlated reinforcement 
Situations by means of macromolar theory. For example, in the last 
mentioned illustration, one could assume that the nonreinforcements 
which occur whenever the rat presses above the cutoff produce inhibition, 
and that the effective excitatory potential is in this way maintained at a 
low level which produces only weak responses. Accordingly, there is no 
a priori reason why correlated reinforcement should require alteration 
of the macromolar assumption. Instead, one must ask how well the 
analysis fits the known data. 

Evaluation. To evaluate properly the equilibrium solution for cor- 
related reinforcement, it is necessary to have a quantitative estimate of 
the appropriate incentive function together with obtained performance 
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under some condition of correlated reinforcement. While there are a few 
studies providing the latter kind of information, the absence of the in- 
centive function obtained on those apparatus puts the evaluation at a 
lower level of quantitative specificity where success is less demanding. 

All the studies known to the writer have used conditions in which re- 
ward was given only if performance fell within a specified range. One 
type has involved varying the minimum performance that will be re- 
inforced, as in the study by Arnold [3] where only bar responses by the 
rat above 5 to 30 gm were rewarded. Another type has involved setting 
a maximum performance above which the response is nonreinforced. 
Early work by Skinner [160], as recently extended by Wilson and Keller 
[201], required the subject to delay at least a specified interval (i.e. 
have a long latency) between responses in a free-responding situation. 
Finally, the Arnold study also included a combination of the two, where 
both a minimum and maximum were specified and only bar responses 
within a restricted range of forces were rewarded. 

The equilibrium analysis would predict that the subject’s adjustment 
to such conditions would show a shift in average performance appropriate 
to any shift in the minimum and/or maximum performance specified, 
but that variability of behavior around this average would have to in- 
sure sufficient nonreinforcements to keep response tendency at the 
equilibrium level. For example, in the Skinner situation, requiring say 
a 20-sec latency between responses, the amount and delay of reinforce- 
ment might, if received continuously, produce incentive for 5-sec latency. 
However, this amount and delay of reward, if received on only part of 
the trials, would presumably produce less incentive and hence a longer 
latency. Without knowledge of performance under various patterns of 
uncorrelated reinforcement, the most that can be said is that adjustment 
cannot be perfect, i.e., at least some nonreinforced responses will have to 
occur to reduce response tendency. 

At this level of quantitative specificity, all of the results by these 
experimenters appear consistent with the equilibrium solution. All have 
obtained distributions of performance showing some but imperfect ad- 
Justment to the specified conditions. 

The only research designed to permit quantitative evaluation of the 
equilibrium analysis is currently in progress in the writer’s laboratory: 
Correlating amount and delay of reward with the rat’s speed of running 
down a straight alley, we have found that some kinds of conditions seem 
to fit the equilibrium analysis quite well; other conditions, however, show 
clear deviations in that performance is affected not only by what reward 
is received but by how it is correlated with the response dimension. 
Generally, so far it appears that speed can best be predicted by attend- 
ing to the differential reinforcement given different speeds. That is, the 
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likelihood that the rat will run at a particular speed depends upon what 
he gets for it compared with what he can get for running at some other 
speed. This is, of course, the kind of result a micromolar theory would 
predict. 

The reader should recognize that the entire discussion of correlated 
reinforcement has been cast at a level not unique to a Hullian theory. 
The functions have not been broken down into specific intervening 
variables with particular anchoring operations. Instead, the equilibrium 
solution has been left at a level appropriate to any stimulus-response 
approach which might choose to use it. What appears to be the singular, 
unique underlying assumption is one that is shared by most such systems, 
namely that quantitative variations in performance reflect variations in 
a molar response tendency. If this is true, then failure of the solution to 
fit empirical data would strongly favor adoption of the micromolar 
approach in which quantitatively different behaviors are viewed as dif- 
ferent responses with separate response tendencies. 

However, a micromolar theory is enormously complex as a deriva- 
tional instrument. It would, therefore, seem sensible to resist abandoning 
the simpler and more conventional level of analysis until the accumulated 
evidence is more conclusive and until any alternative answers to the 
problem have been found wanting. At the same time, transition to a 
micromolar approach may ultimately prove fruitful. 


SUMMARY AND CONCLUSION 


It is hoped that bringing together under one cover even brief 

descriptions of a number of diverse but related matters concerning the 
Hullian approach will serve some value. But it must be emphasized that 
many of these have been dealt with by a number of others and that it has 
been assumed that interested readers will also avail themselves of the 
listed references for a more complete perspective of the Hullian approach 
at mid-twentieth century. 
: This paper has attempted to reflect a dynamic approach to theory 
in psychology, constantly adapting to expanding knowledge. We have 
briefly described what the goals of the approach are and how they are 
conceptualized in a larger behavior science. We have discussed the 
Present methods for attaining these goals, emphasizing the necessity of 
reliable systematic data for improving the theory. We have described 
Some of the theoretical ideas which have been suggested, particularly 
those by Spence and Hull, and have pointed out how these have been 
and are being affected by detailed, quantitative investigations. 

We have seen that the Hullian approach is initially characterized by 
the use of intervening variables in a postulate system. It has emphasized 
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not only the importance of formalizing the postulates, but also the pos- 
sibility of quantifying them. It is more finely distinguished from other 
quantitative intervening variable approaches by particular assumptions 
about the notions of stimulus and of response, about the nature of the 
associative variable, and about the conditions necessary for learning. 

We have discussed these latter assumptions in the context of con- 
troversies in which the approach has been involved and have seen that 
most of the difficulties encountered by it have arisen when one leaves the 
simpler situations with which it was originally concerned. The allegedly 
embarrassing situations are more complex either in the sense that one can- 
not immediately identify all the boundary conditions which must be used 
along with the postulates in making deductions, or in the sense that new 
variables which have not yet been explicitly formulated are involved. 
Historically, one can see that, as these situations have been analyzed for 
their boundary conditions and possible new variables, the present ap- 
proach has not only been found to be consistent with the phenomena, but 
has extended our ability to predict the results of further systematic 
studies. It is perhaps not unreasonable to extrapolate this trend and 
believe that problems not now amenable to Hullian analysis will, in due 
time, be resolved. 
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INTRODUCTION 


A scientist is an extremely complex organism, and his behavior still 
resists a successful empirical analysis. Nevertheless, if anything useful is 
to be said about him, either in trying to understand his behavior or in 
inculcating similar behavior in others, it will be by way of an empirical, 
rather than a formal, analysis. As an antiformalist it would be incon- 
sistent of me to describe my own scientific activity in the formal frame- 
work of Project A. I have therefore reacted to the proposal of the director 
by illustrating my own philosophy of science with a personal history. 


It has been said that college teaching is the only profession for which 
there is no professional training, and it is commonly argued that this is 
because our graduate schools train scholars and scientists rather than 
teachers. We are more concerned with the discovery of knowledge than 
with its dissemination. But can we justify ourselves quite so easily? It is 
a bold thing to say that we know how to train a man to be a scientist. 
Scientific thinking is the most complex and probably the most subtle of 
all human activities, Do we actually know how to shape up such be- 
havior, or do we simply mean that some of the people who attend our 
Sraduate schools eventually become scientists? à 

Except for a laboratory course which acquaints the student with 
standard apparatus and standard procedures, the only explicit training in 
Scientific method generally received by a young psychologist is a course 


*This paper, originally conceived in response to the invitation to contribute to 
the Present study, was presented in substantially similar form as the presidential 
address to the Eastern Psychological Association, April, 1955, and published in 

he American Psychologist, May, 1956. 
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in statistics—not the introductory course, which is often required of so 
many kinds of students that it is scarcely scientific at all, but an advanced 
course which includes “model building,” “theory construction,” and 
“experimental design.” But it is a mistake to identify scientific practice 
with the formalized constructions of statistics and scientific method. 
These disciplines have their place, but it does not coincide with the place 
of scientific research. They offer a method of science but not, as is so 
often implied, the method. As formal disciplines they arose very late in 
the history of science, and most of the facts of science have been dis- 
covered without their aid. It takes a great deal of skill to fit Faraday with 
his wires and magnets into the picture which statistics gives us of scien- 
tific thinking. And most current scientific practice would be equally re- 
fractory, especially in the important initial stages. It is no wonder that 
the laboratory scientist is puzzled and often dismayed when he discovers 
how his behavior has been reconstructed in the formal analyses of scien- 
tific method. He is likely to protest that this is not at all a fair representa- 
tion of what he does. 

But his protest is not likely to be heard. For the prestige of statistics 
and scientific methodology is enormous. Much of it is borrowed from the 
high repute of mathematics and logic, but much of it derives from the 
flourishing state of the art itself. Some statisticians are professional people 
employed by scientific and commercial enterprises. Some are teachers and 
pure researchers who give their colleagues the same kind of service for 
nothing—or at most a note of acknowledgment. Many are zealous peo- 
ple who, with the best of intentions, are anxious to show the nonstatistical 
scientist how he can do his job more efficiently and assess his results more 
accurately, There are strong professional societies devoted to the ad- 
vancement of statistics, and hundreds of technical books and journals are 
published annually. 

Against this, the practicing scientist has very little to offer. He cannot 
refer the young psychologist to a book which will tell him how to find 
out all there is to know about a subject matter, how to have the good 
hunch which will lead him to devise a suitable piece of apparatus, how to 
develop an efficient experimental routine, how to abandon an unprofit- 
able line of attack, how to move on most rapidly to later stages of his 
research. The work habits which have become second nature to him have 
not been formalized by anyone, and he may feel that they possibly never 
will be. As Richter [5] has pointed out, “Some of the most important 
discoveries have been made without any plan of research,” and “there 
are researchers who do not work on a verbal plane, who cannot put into. 
words what they are doing.” 

If we are interested in perpetuating the practices responsible for the 
present corpus of scientific knowledge, we must keep in mind that some 
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very important parts of the scientific process do not now lend themselves 
to mathematical, logical, or any other formal treatment. We do not 
know enough about human behavior to know how the scientist does what 
he does. Although statisticians and methodologists may seem to tell us, 
or at least imply, how the mind works—how problems arise, how hy- 
potheses are formed, deductions made, and crucial experiments designed 
—we as psychologists are in a position to remind them that they do not 
have methods appropriate to the empirical observation or the functional 
analysis of such data. These are aspects of human behavior, and no one 
knows better than we how little can at the moment be said about them. 

Some day we shall be better able to express the distinction between 
empirical analysis and formal reconstruction, for we shall have an alter- 
native account of the behavior of Man Thinking. Such an account will 
not only plausibly reconstruct what a particular scientist did in any 
given case, it will permit us to evaluate practices and, I believe, to teach 
scientific thinking. But that day is some little distance in the future. 
Meanwhile we can only fall back on examples. 

Some time ago the director of Project A of the American Psycho- 
logical Association asked me to describe my activities as a research psy- 
chologist. I went through a trunkful of old notes and records and, for 
my pains, reread some of my earlier publications. This has made me all 
the more aware of the contrast between the reconstructions of formalized 
Scientific method and at least one case of actual practice. Instead of 
amplifying the points I have just made by resorting to a generalized 
account which is not available, I should like to discuss a case history. It 
is not one of the case histories we should most like to have, but what it 
lacks in importance is perhaps somewhat offset by accessibility. I there- 
fore ask you to imagine that you are all clinical psychologists—a task 
which becomes easier and easier as the years go by—while I sit across the 
desk from you or stretch out upon this comfortable leather couch. 


A CASE HISTORY 


The first thing I can remember happened when I was only twenty- 
{wo years old. Shortly after I had graduated from college, Bertrand 
Russell published a series of articles in the old Dial magazine on the 
epistemology of John B. Watson’s behaviorism. I had had no psychology 
as an undergraduate but I had had a lot of biology, and two of the books 
Which my biology professor had put into my hands were Loeb s 
P hysiology of the Brain and the newly published Oxford edition | of 

avlov’s Conditioned Reflexes. And now here was Russell extrapolating 
the principles of an objective formulation of behavior to the problem of 
knowledge. Many years later when I told Lord Russell that his articles 
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were responsible for my interest in behavior, he could only exclaim, 
“Good Heavens! I had always supposed that those articles had de- 
molished Behaviorism!” But at any rate he had taken Watson seriously, 
and so did I. 

When I arrived at Harvard for graduate study, the air was not 
exactly full of behavior, but Walter Hunter was coming in once a week 
from Clark University to give a seminar, and Fred Keller, also a graduate 
student, was an expert in both the technical details and the sophistry of 
behaviorism. Many a time he saved me as I sank into the quicksands of 
an amateurish discussion of “What is an image?” or “Where is red?” I 
soon came into contact with W. J. Crozier, who had studied under Loeb. 
It had been said of Loeb, and might have been said of Crozier, that 

he “resented the nervous system.” 

Whether this was true or not, the 

fact was that both these men talked 

about animal behavior without 

mentioning the nervous system and 

with surprising success. So far as I 

Y was concerned, they canceled out 
the physiological theorizing of 

Pavlov and Sherrington and thus 

clarified what remained of the work 

of these men as the beginnings of 

an independent science of behavior. 

My doctoral thesis was in part an 

Fic. 1 operational analysis of Sherrington’s 

synapse, in which behavioral laws 

were substituted for supposed states of the central nervous system. 

But the part of my thesis at issue here was experimental. So far as 
I can see, I began simply by looking for lawful processes in the behavior 
of the intact organism. Pavlov had shown the way; but I could not 
then, as I cannot now, move without a jolt from salivary reflexes to 
the important business of the organism in everyday life. Sherrington and 
Magnus had found order in surgical segments of the organism. Could 
not something of the same sort be found, to use Locb’s phrase, in “the 
organism as a whole”? I had the clue from Pavlov: control your 
conditions and you will see order. 

Tt is not surprising that my first gadget was a silent release box, 
operated by compressed air and designed to eliminate disturbances when 
introducing a rat into an apparatus. I used this first in studying the 
way a rat adapted to a novel stimulus. I built a soundproofed box con- 
taining a specially structured space. A rat was released, pneumatically, 
at the far end of a darkened tunnel from which it emerged in exploratory 
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fashion into a well-lighted area. To accentuate its progress and to 
_ facilitate recording, the tunnel was placed at the top of a flight of steps, 

something like a functional Parthenon (Fig. 1). The rat would peek out 
from the tunnel, perhaps glancing suspiciously at the one-way window 
through which I was watching it, then stretch itself cautiously down the 
steps. A soft click (carefully calibrated, of course) would cause it to pull 
back into the tunnel and remain there for some time. But repeated clicks 
had less and less of an effect. I recorded the rat’s advances and retreats 
by moving a pen back and forth across a moving paper tape. 

The major result of this experiment was that some of my rats had 
babies. I began to watch young rats. I saw them right themselves and 
crawl about very much like the decerebrate or thalamic cats and rabbits 
of Magnus. So I set about studying the postural reflexes of young rats. 


ee a a a a 
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Here was a first principle not formally recognized by scientific method- 
Ologists: when you run onto something interesting, drop everything else 
and study it. I tore up the Parthenon and started over. 

If you hold a young rat on one hand and pull it gently by the 
tail, it will resist you by pulling forward and then, with a sudden 
sharp spring which usually disengages its tail, it will leap out into space. 
T decided to study this behavior quantitatively. I built a light platform 
Covered with cloth and mounted it on tightly stretched piano wires ( Fig. 
:)- Here was a version of Sherrington’s torsion-wire myograph, originally 
designed to record the isometric contraction of the tibialis anticus of a 
fat, but here adapted to the response of a whole organism. When the 
tail of the young rat was gently pulled, the rat clung to the cloth floor 
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and tugged forward. By amplifying the fine movement of the platform, 
it was possible to get a good kymograph record of the tremor in this _ 
motion and then, as the pull against the tail was increased, of the 
desperate spring into the air (Fig. 3). 

Now, baby rats have very little future, except as adult rats. Their 
behavior is literally infantile and cannot be usefully extrapolated to 
everyday life. But if this technique would work with a baby, why not 
try it on a mature rat? To avoid attaching anything to the rat, it should 
be possible to record, not a pull against the substrate, but the ballistic 
thrust exerted as the rat runs forward or suddenly stops in response to 
my calibrated click. So, invoking the first principle of scientific practice 


Fic. 4 


again, I threw away the piano-wire platform and built a runway, 8 ft 
long. This was constructed of light wood, in the form of a U girder, 
mounted rigidly on vertical glass plates, the elasticity of which permitted 
a very slight longitudinal movement (Fig. 4). The runway became the 
floor of a long tunnel, not shown, at one end of which I placed my 
soundless release box and at the other end myself, prepared to reinforce 
the rat for coming down the runway by giving it a bit of wet mash, to 
sound a click from time to time when it had reached the middle of the 
runway, and to harvest kymograph records of the vibrations of the 
substrate. 

Now for a second unformalized principle of scientific practice: some 
ways of doing research are easier than others. I got tired of carrying 
the rat back to the other end of the runway. A back alley was therefore 


A Case History in Scientific Method 365 


added (Fig. 5). Now the rat could eat a bit of mash at point C, go down 
the back alley A, around the end as shown, and back home by runway 
B. The experimenter at E could collect records from the kymograph at 
D in comfort. In this way a great many records were made of the forces 
exerted against the substratum as rats ran down the alley and occasionally 
stopped dead in their tracks as a click sounded (Fig. 6). 


Time —— 
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There was one annoying detail, however. The rat would often wait 
an inordinately long time at C before starting down the back alley on the 
next run. There seemed to be no explanation for this. When I timed 
these delays with a stop watch, however, and plotted them, they seemed 
to show orderly changes (Fig. 7). This was, of course, the kind of thing 
I was looking for. I forgot all about the movements of the substratum 
and began to run rats for the sake of the delay measurements alone. But 
there was now no reason why the runway had to be 8 ft long and, as 
the second principle came into 
play again, I saw no reason why 
the rat could not deliver its own 
reinforcement. 

A new apparatus was built. 
In Fig. 8 we see the rat eating 
a piece of food just after com- 
pleting a run. It produced the 
food by its own action. As it ran 
down the back alley A to the far 
end of the rectangular runway, its weight caused the whole runway to 
tilt slightly on the axis C and this movement turned the wooden disc 
D, permitting a piece of food in one of the holes around its perimeter 
to drop through a funnel into a food dish. The food was pearl barley, 
the only kind I could find in the grocery stores in reasonably uniform 
Pieces, The rat had only to complete its journey by coming down the 
home stretch B to enjoy its reward. The experimenter was able to enjoy 
his reward at the same time, for he had only to load the magazine, 
put in a rat, and relax. Each tilt was recorded in a slowly moving 
kymograph. 

A third unformalized principle of scientific practice: some people 
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are lucky. The disc of wood from which I had fashioned the food 
magazine was taken from a storeroom of discarded apparatus. It 
happened to have a central spindle, which fortunately I had not bothered 
to cut off. One day it occurred to me that if I wound a string around 
the spindle and allowed it to unwind as the magazine was emptied (Fig. 
9), I would get a different kind of record. Instead of a mere report of the 
up-and-down movement of the runway, as a series of pips as in a poly- 
graph, I would get a curve.'And I knew that science made great use of 
curves, although, so far as I could discover, very little of pips on a 
polygram. The difference between the old type of record at A (Fig. 10) 
and the new at B may not seem great, but as it turned out the curve 
revealed things in the rate of responding, and 
in changes in that rate, which would certainly 
otherwise have been missed. By allowing the 
string to unwind rather than to wind, I had 
got my curve in an awkward cartesian 
quadrant, but that was casily remedied. 
Psychologists have adopted cumulative curves 


(A) 


(4) 
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only very slowly, but I think it is fair to say that they have become an 
indispensable tool for certain purposes of analysis. 

Eventually, of course, the runway was seen to be unnecessary. The 
rat could simply reach into a covered tray for pieces of food, and each 
movement of the cover could operate a solenoid to move a pen one step 
in a cumulative curve. The first major change in rate observed in this 
way was due to ingestion. Curves showing how the rate of eating declined 
with the time of eating comprised the other part of my thesis. But 4 
refinement was needed. The behavior of the rat in pushing open the 
door was not a normal part of the ingestive behavior of Rattus rattus. 
The act was obviously learned but its status as part of the final perform- 
ance was not clear. It seemed wise to add an initial conditioned response 
connected with ingestion in a quite arbitrary way. I chose the first 
device which came to hand—a horizontal bar or lever placed where it 
could be conveniently depressed by the rat to close a switch which 
operated a magnetic magazine. Ingestion curves obtained with this 
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initial response in the chain were found to have the same properties as 
those without it. 

Now, as soon as you begin to complicate an apparatus, you neces- 
sarily invoke a fourth principle of scientific practice: apparatus some- 
times breaks down. I had only to wait for the food magazine to jam to get 
an extinction curve. At first I treated this as a defect and hastened to 
remedy the difficulty. But eventually, of course, I deliberately discon- 
nected the magazine. I can easily recall the excitement of that first com- 
plete extinction curve (Fig. 11). I had made contact with Pavlov at 
last! Here was a curve uncorrupted by the physiological process of 
ingestion. It was an orderly change due to nothing more than a special 
contingency of reinforcement. It was pure behavior! I am not saying 
that I would not have got around to extinction curves without a break- 
down in the apparatus; Pavlov had given too strong a lead in that 
direction. But it is still no exaggeration to say that some of the most 
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interesting and surprising results have turned up first because of similar 
accidents. Foolproof apparatus is no doubt highly desirable, but Charles 
Ferster and I in recently reviewing the data from a five-year program of 
Tesearch found many occasions to congratulate ourselves on the fallibility 
of relays and vacuum tubes. 

I then built four soundproofed ventilated boxes, each containing a 
lever and a food magazine and supplied with a cumulative recorder, 
and was on my way to an intensive study of conditioned reflexes in 
skeletal behavior. I would reinforce every response for several days and 
then extinguish for a day or two, varying the number of reinforcements, 
the amount of previous magazine training, and so on. 

At this point I made my first use of the deductive method. I had 
long since given up pearl barley as too unbalanced a diet for steady 
use. A neighborhood druggist had shown me his pill machine, and I 
had had one made along the same lines (Fig. 12). It consisted of a 
fluted brass bed across which one laid a long cylinder of stiff paste 
(in my case a MacCollum formula for an adequate rat diet). A similarly 
fluted cutter was then lowered onto the cylinder and rolled slowly 
back and forth, converting the paste into about a dozen spherical pellets. 
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These were dried for a day or so before use. The procedure was painstak- 
ing and laborious. Eight rats eating a hundred pellets each per day 
could easily keep up with production. One pleasant Saturday afternoon 
I surveyed my supply of dry pellets and, appealing to certain elemental 
theorems in arithmetic, deduced that unless I spent the rest of that 
afternoon and evening at the pill machine, the supply would be ex- 
hausted by 10:30 Monday morning. 

Since I do not wish to deprecate the hypothetico-deductive method, 
I am glad to testify here to its usefulness. It led me to apply our second 
principle of unformalized scientific method and to ask myself why every 
press of the lever had to be reinforced. I was not then aware of what 
had happened at the Brown laboratories, as Harold Schlosberg later told 
the story. A graduate student had been given the task of running a cat 
through a difficult discrimination experiment. One Sunday the student 
found the supply of cat food exhausted. The stores were closed and so, 
with a beautiful faith in the frequency theory of learning, he ran the 
cat as usual and took it back to its living cage unrewarded. Schlosberg re- 
ports that the cat howled its protest continuously for nearly forty-eight 
hours. Unaware of this I decided to reinforce a response only once every 
minute and to allow all other responses to go unreinforced. There were 
two results: (1) my supply of pellets lasted almost indefinitely and (2) 
each rat stabilized at a fairly constant rate of responding. 

Now, a steady state was something I was familiar with from physical 
chemistry, and I therefore embarked upon the study of periodic rein- 
forcement. I soon found that the constant rate at which the rat stabilized 
depended upon how hungry it was, Hungry rat, high rate; less hungry 
rat, lower rate. At that time I was bothered by the practical problem of 
controlling food deprivation. I was working half time at the Medical 
School (on chronaxie of subordination!) and could not maintain a good 
schedule in working with the rats. The rate of responding under periodic 
reinforcement Suggested a scheme for keeping a rat at a constant level 
of deprivation. The argument went like this: suppose you reinforce 
the rat, not at the end of a given period, but when it has completed the 
number of responses ordinarily emitted in that period. And suppose you 
use substantial pellets of food and give the rat continuous access to the 
lever. Then, except for periods when the rat sleeps, it should operate the 
lever at a constant rate around the clock. For, whenever it grows slightly 
hungrier, it will work faster, get food faster, and become less hungry, 
while whenever it grows slightly less hungry, it will respond at a lower 
rate, gets less food, and grow hungrier. By setting the reinforcement at a 
given number of responses it should even be possible to hold the rat at 
any given level of deprivation. I visualized a machine with a dial which 
one could set to make available, at any time of day or night, a rat in 


A Case History in Scientific Method 369 


a given state of deprivation. Of course, nothing of the sort happens. This 
is “fixed-ratio” rather than “fixed-interval” reinforcement, and as I soon 
found out, it produces a very different type of performance, This is an 
example of a fifth unformalized principle of scientific practice, but one 
which has at least been named. Walter Cannon described it with a word 
invented by Horace Walpole: serendipity—the art of finding one thing - 
while looking for something else. 

This account of my scientific behavior up to the point at which I 
published my results in a book called The Behavior of Organisms is as 
exact in letter and spirit as I can now make it. The notes, data, and 
publications which I have examined do not show that I ever behaved 
in the manner of Man Thinking as described by John Stuart Mill or 
John Dewey or as in reconstructions of scientific behavior by other 
philosophers of science. I never faced a Problem which was more than 
the eternal problem of finding order. I never attacked a problem by 
constructing a Hypothesis. I never deduced Theorems or submitted them 
to Experimental Check. So far as I can see, I had no preconceived 
Model of behavior—certainly not a physiological or mentalistic one, and 
I believe, not a conceptual one. The “reflex reserve’? was an abortive, 
though operational, concept which was retracted a year or so after 
publication in a paper at the Philadelphia meeting of the APA. It lived 
up to my opinion of theories in general by proving utterly worthless 
in suggesting further experiments. Of course, I was working on a basic 
Assumption—that there was order in behavior if I could only discover 
it—but such an assumption is not to be confused with the hypotheses 
of deductive theory. It is also true that I exercised a certain Selection of 
Facts, not because of relevance to theory but because one fact was more 
orderly than another. If I engaged in Experimental Design at all, it was 
simply to complete or extend some evidence of order already observed. 

Most of the experiments described in The Behavior of Organisms 
Were done with groups of four rats. A fairly common reaction to the 
book was that such groups were too small. How did I know that other 
groups of four rats would do the same thing? Keller, in defending the 
book, countered with the charge that groups of four were too big. 
Unfortunately, however, I allowed myself to be persuaded of the 
Contrary, This was due in part to my association at the University of 
Minnesota with W. T. Heron. Through him I came into close contact 
for the first time with traditional animal psychology. Heron was in- 
terested in inherited maze behavior, inherited activity, and certain 
drugs—the effects of which could then be detected only through the use 
of fairly large groups. We did an experiment together on the effect of 
Starvation on the rate of pressing a lever and started the new era 
with a group of 16 rats. But we had only 4 boxes, and this was so incon- 
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venient that Heron applied for a grant and built a battery of 24 lever- 
boxes and cumulative recorders. I supplied an attachment which would 
record not only the mean performance of all 24 rats in a single averaged 
curve, but mean curves for 4 subgroups of 12 rats each and 4 subgroups 
of six rats each [3]. We thus provided for the design of experiments ac- 
cording to the principles of R. A. Fisher, which were then coming into 
vogue. We had, so to speak, mechanized the latin square. 

With this apparatus Heron and I published a study of extinction in 
maze-bright and maze-dull rats using 95 subjects, Later I published 
mean extinction curves for groups of 24, and W. K. Estes and I did 
our work on anxiety with groups of the same size. But although Heron 
and I could properly voice the hope that “the possibility of using large 
groups of animals greatly improves upon the method as previously re- 
ported, since tests of significance are provided for and properties of 
behavior not apparent in single cases may be more easily detected,” in 
actual practice that is not what happened. The experiments I have just 
mentioned are almost all we have to show for this elaborate battery of 
boxes. Undoubtedly more work could be done with it and would have 
its place, but something had happened to the natural growth of the 
method. You cannot easily make a change in the conditions of an experi- 
ment when -24 apparatus have to be altered. Any gain in rigor is more 
than matched by a loss in flexibility. We were forced to confine ourselves 
to processes which could be studied with the base lines already developed 
in earlier work. We could not move on to the discovery of other processes 
or even to a more refined analysis of those we were working with. No 
matter how significant might be the relations we actually demonstrated, 
our statistical Leviathan had swum aground. The art of the method 
had stuck at a particular stage of its development. 

Another accident rescued me from mechanized statistics and brought 
me back to an even more intensive concentration on the single case. In 
essence, I suddenly found myself face to face with the engineering prob- 
lem of the animal trainer. When you have the responsibility of making 
absolutely sure that a given organism will engage in a given sort of 
behavior at a given time, you quickly grow impatient with theories of 
learning. Principles, hypotheses, theorems, satisfactory proof at the .05 
level of significance that behavior at a choice point shows the effect of 
secondary reinforcement—nothing could be more irrelevant. No one 
goes to the circus to see the average dog jump through a hoop signif- 
icantly oftener than untrained dogs raised under the same circumstances, 
or to see an elephant demonstrate a principle of behavior. 

Perhaps I can illustrate this without giving aid and comfort to the 
enemy by describing a Russian device which the Germans found quite 
formidable. The Russians used dogs to blow up tanks. A dog was trained 
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to hide behind a tree or wall in low brush or other cover. As a tank 
approached and passed, the dog ran swiftly alongside it, and a small 
magnetic mine attached to the dog’s back was sufficient to cripple the 
tank or set it afire. The dog, of course, had to be replaced. 

Now I ask you to consider some of the technical problems which the 
psychologist faces in preparing a dog for such an act of unintentional 
heroism. ‘The dog must wait behind the tree for an indefinite length of 
time. Very well, it must, therefore, be intermittently reinforced for wait- 
ing. But what schedule will achieve the highest probability of waiting? 
If the reinforcement is to be food, what is the absolutely optimal 
schedule of deprivation consistent with the health of the dog? The dog 
must run to the tank—that can be arranged by reinforcing it with a 
practice tank—but it must start instantly if it is to overtake a swift tank, 
and how do you differentially reinforce short reaction times, especially in 
counteracting the reinforcement for sitting and waiting? The dog must 
react only to tanks, not to a refugee driving his oxcart along the road, but 
what are the defining properties of a tank so far as a dog is concerned? 

I think it can be said that a functional analysis proved adequate in 
its technological application. Manipulation of environmental conditions 
alone made possible a wholly unexpected practical control. Behavior 
could be shaped up according to specifications and maintained in- 
definitely almost at will. One behavioral technologist who worked with 
me at the time (Keller Breland) is now specializing in the production of 
behavior as a salable commodity and has described this new profession 
in the American Psychologist [2]. à 

There are many useful applications within psychology itself. Ratliff and 
Blough have recently conditioned pigeons to serve as psychophysical 
observers, In their experiment a pigeon may adjust one of two spots of 
light until the two are equally bright or it may hold a spot of light 
at the absolute threshold during dark adaptation. The techniques which 
they have developed to induce pigeons to do this are only indirectly 
related to the point of their experiments and hence exemplify the ap- 
plication of a behavioral science [4]. The field in which a better tech- 
nology of behavior is perhaps most urgently needed is education. I can- 
not describe here the applications which are now possible, but perhaps I 
can indicate my enthusiasm by hazarding the guess that educational 
techniques at all age levels are on the threshold of revolutionary changes. 
_ The effect of a behavioral technology on scientific practice is the 
‘sue here. Faced with practical problems in behavior, you necessarily 
Emphasize the refinement of experimental variables. As a result, some of 
the standard procedures of statistics appear to be circumvented. Let me 
illustrate, Suppose that measurements have been made on two groups 
of subjects differing in some detail of experimental treatment. Means 
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and standard deviations for the two groups are determined, and any 
difference due to the treatment is evaluated. If the difference is in the 
expected direction but is not statistically significant, the almost universal 
recommendation would be to study larger groups. But our experience 
with practical control suggests that we may reduce the troublesome 
variability by changing the conditions of the experiment. By discovering, 
elaborating, and fully exploiting every relevant variable, we may 
eliminate in advance of measurement the individual differences which 
obscure the difference under analysis. This will achieve the same result 
as increasing the size of groups, and it will almost certainly yield a bonus 
in the discovery of new variables which would not have been identified 
in the statistical treatment. 

The same may be said of smooth 
curves. In our study of anxiety, Estes 
and I published several curves, the 
reasonable smoothness of which was 
obtained by averaging the perform- 
ance of 12 rats for each curve. The 
individual curves published at that 
time show that the mean curves do 
not faithfully represent the behavior 
of any one rat. They show a certain 
tendency toward a change in slope 
; which supported the point we were 
making, and they may have appeared to justify averaging for that reason. 

But an alternative method would have been to explore the individual 
case until an equally smooth curve could be obtained. This would have 
meant, not only rejecting the temptation to produce smoothness by 
averaging cases, but manipulating all relevant conditions as we later 
learned to manipulate them for practical purposes. The individual curves 
which we published at that time do not point to the need for larger 
groups but for improvement in experimental technique. Here, for ex- 
ample, is a curve the smoothness of which is characteristic of current 
practice. Such curves were shown in the making in a demonstration 
which Ferster and I arranged at the Cleveland meeting of the American 
Psychological Association (Fig. 13). Here, in a single organism, three 
different schedules of reinforcement are yielding corresponding per- 
formances with great uniformity under appropriate stimuli alternating 
at random. One does not reach this kind of order through the application 
of statistical methods, x 

In The Behavior of Organisms I was content to deal with the over- 
all slopes and curvature of cumulative curves and could make only a 
rough classification of the properties of behavior shown by the finer grain. } 
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The grain has now been improved. The resolving power of the micro- 
scope has been increased manyfold, and we can see fundamental processes 
of behavior in sharper and sharper detail. In choosing rate of responding 
as a basic datum and in recording this conveniently in a cumulative 
curve, we make important temporal aspects of behavior visible. Once 
this has happened, our scientific practice is reduced to simple looking. 
A new world is opened to inspection. We use such curves as we use 
a microscope, X-ray camera, or telescope. This is well exemplified by 
recent extensions of the method. These are no longer part of my case 
history, but perhaps you will permit me to consult you about what some 
critics have described as a folie a deux or group neurosis. 

An early application of the method to the behavior of avoidance and 
escape was made by Keller in studying the light aversion of the rat. This 
was brilliantly extended by Murray Sidman in his shock-avoidance 
experiments, It is no longer necessary to describe avoidance and escape 
by appeal to “principles,” for we may watch the behavior develop when 
we have arranged the proper contingencies of reinforcement, as we later 
watch it change as these contingencies are changed. 

Hunt and Brady have extended the use of a stable rate in the study 
of anxiety-producing stimuli and have shown that the depression in rate 
is eliminated by electroconvulsive shock and by other measures which 
are effective in reducing anxiety in human patients. O. R. Lindsley has 
found the same thing for dogs, using insulin-shock therapy and sedatives. 
Brady has refined the method by exploring the relevance of various 
schedules of reinforcement in tracing the return of the conditioned de- 
pression after treatment. In these experiments you see the effect of a 
treatment as directly as you see the constriction of a capillary under the 
microscope. 

Early work with rats on caffeine and Benzedrine has been extended 
by Lindsley with dogs. A special technique for evaluating several effects 
of a drug in a single short experimental period yields a record of behavior 
which can be read as a specialist reads an electrocardiogram. Dr. Peter 
Dews of the Department of Pharmacology at the Harvard Medical 
School is investigating dose-response curves and the types and effects of 
Various drugs, using pigeons as subjects. In the Psychological Labora- 
tories at Harvard additional work on drugs is being carried out by Morse, 
Herrnstein, and Marshall, and the technique is being adopted by drug 
manufacturers. There could scarcely be a better demonstration of the 
experimental treatment of variability. In a single experimental session 
with a single organism one observes the onset, duration, and decline of 
the effects of a drug. } 

The direct observation of defective behavior is particularly important. 
Clinical or experimental damage to an organism is characteristically 
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unique. Hence the value of a method which permits the direct observa- 
tion of the behavior of the individual. Lindsley has studied the effects 
of near-lethal irradiation, and the effects of prolonged anesthesia and 
anoxia are currently being examined by Thomas Lohr in cooperation 
with Dr. Henry Beecher of the Massachusetts General Hospital. The 
technique is being applied to neurological variables in the monkey by 
Dr. Karl Pribram at the Hartford Institute. The pattern of such research 
is simple: establish the behavior in which you are interested, submit the 
organism to a particular treatment, and then look again at the behavior, 
An excellent example of the use of experimental control in the study of 
motivation is some work on obesity by J. E. Anliker in collaboration with 
Dr. Jean Mayer of the Harvard School of Public Health, where ab- 

normalities -of ingestive behavior in 

several types of obese mice can be com- 

x pared by direct inspection. 

There is perhaps no field in which 
behavior is customarily described more 
indirectly than psychiatry. In an ex- 
periment at the Massachusetts State 
Hospital, under the sponsorship of Dr. 
Harry Solomon and myself, O. R. 
Lindsley is carrying out an extensive 

Fio. 14 program which might be characterized 

as a quantitative study of the temporal 

properties of psychotic behavior. Here again it is a question of making 
certain characteristics of the behavior visible, 

The extent to which we can eliminate sources of variability before 
measurement is shown by a result which has an unexpected significance 
for comparative psychology and the study of individual differences. 
Figure 14 shows tracings of three curves which report behavior in re- 
sponse to a multiple fixed-interval fixed-ratio schedule. The hatches 
mark reinforcements, Separating them in some cases are short, steep 
lines showing a high constant rate on a fixed-ratio schedule and, 
in others, somewhat longer “scallops” showing a smooth acceleration as 
the organism shifts from a very low rate just after reinforcement to A 
higher rate at the end of the fixed interval. The values of the intervals 
and ratios, the states of deprivation, and the exposures to the schedules 
were different in the three cases, but except for these details the curves 
are quite similar. Now, one of them was made by a pigeon in some 
experiments by Ferster and me, one was made by a rat in an experiment 
on anoxia by Lohr, and the third was made by a monkey in Karl 
Pribram’s laboratory at the Hartford Institute. Pigeon, rat, monkey; 
which is which? It doesn’t matter. Of course, these three species have 
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behavioral repertoires which are as different as their anatomies. But once 
you have allowed for differences in the ways in which they make contact 
with the environment, and in the ways in which they act upon the en- 
vironment, what remains of their behavior shows astonishingly similar 
properties. Mice, cats, dogs, and human children could have added other 
curves to this figure. And when organisms which differ as widely as this 
nevertheless show similiar properties of behavior, differences between 
members of the same species may be viewed more hopefully, Difficult 
problems of idiosyncrasy or individuality will always arise as products of 
biological and cultural processes, but it is the very business of the ex- 
perimental analysis of behavior to devise techniques which reduce their 
effects except when they are explicitly under investigation, 

We are within reach of a science of the individual. This will be 
achieved, not by resorting to some special theory of knowledge in which 
intuition or understanding takes the place of observation and analysis, 
but through an increasing grasp of relevant conditions to produce order 
in the individual case. 

A second consequence of an improved technology is the effect upon 
behavior theory, As I have pointed out elsewhere, it is the function of 
learning theory to create an imaginary world of law and order and thus 
to console us for the disorder we observe in behavior itself. Scores on a T 
maze or jumping stand hop about from trial to trial almost capriciously. 
Therefore we argue that if learning is, as we hope, a continuous and 
orderly process, it must be occurring in some other system of dimensions 
perhaps in the nervous system, or in the mind, or in a conceptual 
Model of behavior. Both the statistical treatment of group means and 
the averaging of curves encourage the belief that we are somehow going 
behind the individual case to an otherwise inaccessible, but more funda- 
mental, process. The whole tenor of our paper on anxiety, for example, 
was to imply that the change we observed was not necessarily a property 
of behavior, but of some theoretical state of the organism (“anxiety”) 
Which was merely reflected in a slight modification of performance. j 

When we have achieved a practical control over the organism, 
theories of behavior lose their point. In representing and managing 
relevant variables, a conceptual model is useless; we come to grips with 

havior itself. When behavior shows order and consistency, we are much 
less likely to be concerned with physiological or mentalistic causes. A 
tum emerges which takes the place of theoretical phantasy. In the 
Experimental analysis of behavior we address ourselves to a subject 
Matter which is not only manifestly the behavior of an individual and 
ence accessible without the usual statistical aids but also “objective” 
and “actual” without recourse to deductive theorizing. t 
Statistical techniques serve a useful function, but they have acquired 
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a purely honorific status which may be troublesome. Their presence or 
absence has become a shibboleth to be used in distinguishing between 
good and bad work. Because measures of behavior have been highly 
variable, we have come to trust only results obtained from large numbers 
of subjects. Because some workers have intentionally or unconsciously re- 
ported only selected favorable instances, we have come to put a high 
value on research which is planned in advance and reported in its en- 
tirety. Because measures have behaved capriciously, we have come to 
value skillful deductive theories which restore order. But although large 
groups, planned experiments, and valid theorizing are associated with 
significant scientific results, it does not follow that nothing can be 
achieved in their absence. Here are two brief examples of the choice 
before us. 
How can we determine the course of dark adaptation in a pigeon? 
We move a pigeon from a bright 
Le light to a dark room. What 
segmen happens? Presumably the bird is 
able to see fainter and fainter 
patches of light as the process of 
adaptation takes place, but how 
can we follow this process? One 
way would be to set up a dis 
o h d eS crimination apparatus in which 
Be choices would be made at specific 
Minutes in dark . SR: f 
Fic. 15 intervals after the beginning © 
dark adaptation. The test patches 
of light could be varied over a wide range, and the percentages of 
correct choices at each value would enable us eventually to locate the 
threshold accurately. But hundreds of observations would be needed 
to establish only a few points on the curve and to prove that these 
show an actual change in sensitivity. In the experiment by Blough al- 
ready mentioned, the pigeon holds a spot of light close to the threshold 
throughout the experimental period. A single curve, such as the one 
sketched in Fig. 15, yields as much information as hundreds of read- 
ings, together with the means and standard deviations derived from 
them. The information is more accurate because it applies to a single 
organism in a single experimental session. Yet many psychologists who 
would accept the first as a finished experiment because of the tables 
of means and standard deviations would boggle at the second or call ita 
preliminary study. The direct evidence of one’s senses in observing @ 
process of behavior is not trusted. 
As another example, consider the behavior of several types of obese 
mice. Do they all suffer from a single abnormality in their eating be 
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havior or are there differences? One might attempt to answer this with 
some such measure of hunger as an obstruction apparatus. The numbers 
of crossings of a grid to get to food, counted after different periods of 
free access to food, would be the data. Large numbers of readings would 
be needed, and the resulting mean values would possibly not describe 
the behavior of any one mouse in any experimental period. A much 
better picture may be obtained with one mouse of each kind in single 
experimental sessions, as Anliker has shown [1]. In an experiment re- 
ported roughly in Fig. 16, each mouse was reinforced with a small piece 
of food after completing a short “ratio” of responses. The hypothalamic- 
obese mouse shows an exaggerated but otherwise normal ingestion curve. 
The hereditary-obese mouse eats slowly but for an indefinite length of 
time and with little change in rate. The gold-poisoned obese mouse shows 


Hypothalamic obese: 


A Goldthioglucose obese 


Responses 


‘Hereditary obese 


Time 
Fic. 16 


a sharp oscillation between periods of very rapid responding and no 
responding at all. These three individual curves contain more informa- 
tion than could probably ever be generated with measures requiring 
Statistical treatment, yet they will be viewed with suspicion by many 
Psychologists because they are single cases. 

It is perhaps natural that psychologists should awaken only slowly to 
the possibility that behavioral processes may be directly observed, or that 
they should only gradually put the older statistical and theoretical tech- 
niques in their proper perspective. But it is time to insist that science does 
not progress by carefully designed steps called “experiments” each of 
Which has a well-defined beginning and end, Science is a continuous and 
often a disorderly and accidental process. We shall not do the young psy- 
chologist any favor if we agree to reconstruct our practices to fit the 
Pattern demanded by current scientific methodology. What the statisti- 
lan means by the design of experiments is design which yields the kind 
of data to which his techniques are applicable. He does not mean the 
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behavior of the scientist in his laboratory devising research for his own 
immediate and possibly inscrutable purposes. 

The organism whose behavior is most extensively modified and most 
completely controlled in research of the sort I have described is the ex- 
perimenter himself. The point was well made by a cartoonist in the 
Columbia Jester (Fig. 17). The caption read: “Boy, have I got this guy 
conditioned! Every time I press the bar down he drops in a piece of 
food.” The subjects we study reinforce us much more effectively than we 
reinforce them. I have been telling you simply how I have been condi- 

tioned to behave. And of course 

it is a mistake to argue too much 

from one case history. My be- 

x havior would not have been 

shaped as it was were it not for 

personal characteristics which all 

psychologists fortunately do not 

share. Freud has had something 

to say about the motivation of 

scientists and has given us some 

Fıc. 17 insight into the type of person who 

achieves the fullest satisfaction 

from precise experimental design and the intricacies of deductive systems. 

Such a person tends to be more concerned with his success as a scientist 

than with his subject matter, as is shown by the fact that he often as- 

sumes the role of roving ambassador. If this seems unfair, let me hasten 

to characterize my own motivation in equally unflattering terms. Several 

years ago I spent a pleasant summer writing a novel called Walden 

Two. One of the characters, Frazier, said many things which I was not 
yet ready to say myself. Among them was this: 


I have only one important characteristic, Burris: I’m stubborn. I’ve had 
only one idea in my life—a true idée fixe . . . to put it-as bluntly as pos- 
sible, the idea of having my own way. “Control” expresses it, I think. The 
control of human behavior, Burris. In my early experimental days it was 
a frenzied, selfish desire to dominate. I remember the rage I used to feel 
when a prediction went awry. I could have shouted at the subjects of my 
experiments, “Behave, damn you, behave as you ought!” Eventually I 
realized that the subjects were always right. They always behave as they 
ought. It was I who was wrong. I had made a bad prediction. 


(In fairness to Frazier and the rest of myself, I want to add his next 
remark: “And what a strange discovery for a would-be tyrant, that the 
only effective technique of control is unselfish.” Frazier means, of course, 
positive reinforcement. ) 

We have no more reason to say that all psychologists should behave 
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as I have behaved than that they should all behave like R. A. Fisher. 
The scientist, like any organism, is the product of a unique history. The 
practices which he finds most appropriate will depend in part upon this 
history. Fortunately, personal idiosyncrasies usually leave a negligible 
mark on science as public property. They are important only when we 
are concerned with the encouragement of scientists and the prosecution 
of research. When we have at last an adequate empirical account of the 
behavior of Man Thinking, we shall understand all this. Until then, it 
may be best not to try to fit all scientists into any single mold. 


REFERENCES 


1. Anliker, J. E. Personal communication. 

2. Breland, K., & Breland, Marion. A field of applied animal psychology. 
Amer. Psychologist, 1951, 6, 202-204. 

3. Heron, W. T., & Skinner, B. F. An apparatus for the study of be- 
havior. Psychol. Rec., 1939, 3, 166-176. 

4. Ratliff, F., & Blough, D. S. “Behavioral studies of visual processes 
in the pigeon,” Report of Contract N5ori-07663, Psychological Laboratories, 
Harvard Univer., Sept., 1954. 

5. Richter, C. P. Free research versus design research. Science, 1953, 
118, 91-93. 


THE STATISTICAL APPROACH TO 
LEARNING THEORY’ W. K. ESTES 


Indiana University 


Part I. Orienting Attitudes and Systematic Viewpoint {4}. > 2 . . . . . 382 
Part II. The Development of Statistical Learning Theory 42+, 3, 9}. . . . . 386 
Origins and Initial Formulation. . . ae ey eo 
Emergence of the statistical viewpoint . . . . meg, ~~ el ALG 
Formulation of a probabilistic response model. . . . . . . . . 392 
Stimulus-response relationships and learning . . . . . . . . . 395 
Response variables . mand c m yer ks 3 ee oe emer 
Somulisvariabice we ROR ME oe, a |. 90D 
ect AE e EMER SNe ns. oy BOS 
Reinforcement and contiguity . . a 5 £408 
Growth of Statistical Learning Theory since 1950. . . 2 . . . . . 405 
Tests of theoretical assumptions {8,9} . . . . . . . . . . 405 
Envariance‘ofparametersy = wl ww wl 408 

Effects of reinforcement. . . . . . , a. a AA 
Effects of nonreinforcement. . . . . . . . . . . . « 415 
Independence of learning rates and Bsymptotes, > ea nn 40 
Simuna 2 
Transfer and stimulus compounding . . . 1 aa 2) 
Generalizations of the original statistical model 42, 8, 9p . . . . . 423 
Variation in stimulus sampling pronabiliges@en ne 
Stimulus fluctuation model for time-dependent processes. . . . . 424 
Learning models as stochastic (ert So as ak ee a T ae 


1 Although responsibility for the present report must rest with the writer, the 
theory under discussion is to an important extent a cooperative enterprise. Specific 
citations in the text cannot do full justice to the contributions of present and former 
associates at Indiana University who have collaborated in many researches, both 
experimental and theoretical; Cletus J. Burke and Donald W. Lauer must especially 
be mentioned. Also, I have profited by opportunities to discuss technical problems 
and exchange ideas and results with a number of individuals who have been conduct- 
ing related research at other institutions, particularly Robert R. Bush, Frederick 
Mosteller, and Patrick Suppes. Preparation of this manuscript was facilitated by 4 
fellowship period at the Center for Advanced Study in the Behavioral Sciences. 
Finally, I am indebted to Douglas G. Ellson and to the editor of this volume for de- 
tailed critical comments on the manuscript. 


380 


The Statistical Approach to Learning Theory 


Extension to new empirical areas {2+, 3, 8, 9, 12} . 
Elementary discrimination learning. A 
Classical conditioning and habituatory decrement 
Parameters of reinforcement. Re Mite ee 
Partial reinforcement and resistance to extinction. 
Escape and avoidance learning . 

Drive and conditioning . 
Verbal learning 
Critically needed research 
Part III. Systematic and Comparative Analysis . 
Formal Structure of Statistical Learning Theory 2-6} 
Itemization of systematic concepts 
Mode of definition of systematic concepts 
Major interrelations among constructs . 

Response model . 

Stimulus sampling model 

Stimulus fluctuation model. 

Stimulus-response relations. 

Learning . ae ae 

Itemization of empirical dep and independent E 

Dependent variables. 

Independent variables . no 

Interpretation of formal theory in terms of apical Varlahles 


Rules of correspondence between set-theoretical constructs and al 


variables 
Stimuli. 
Responses . 
Reinforcing events : 
Experimenter-determined vartähle. 
Derivation of experimental laws 
Relations among response measures. 
Time-dependent phenomena 
Stimulus manipulation 
Simple learning 
Discrimination learning . 
Methodology {2-4+} 
Stages in theory construction 
Exploration 
Conceptualization 
Axiomatization 
Mathematical analysis 
Empirical test. 
Exploitation 


381 


430 
430 
432 
433 
436 
438 
440 
442 
443 
445 
445 
447 
447 
450 
450 
451 
452 
452 
453 
454 
454 
455 
456 


456 
456 
457 
458 
460 
461 
462 
463 
464 
465 
468 
469 
469 
469 
470 
470 
470 
471 
471 


382 W. K. ESTES 


Generalzahon and extension’ Eine fi unless oe ss eT 
HOnmalorganiza lon Ohl te. el Wenn me pati coo tn TD 

x Mensurational and quantificational procedures {5}. . . . . . . 473 
Beatie NDR ET en. Oe 
Evaluation of the System {8-10} . . . . ...., , acer ch 
Interrelations with other systems {7,12} . . . . . ... . . . 480 
Olssacallearningitheories rs 2 ee we tis a... . A 

StS CHAS Cc Ode o RRR Cl ces oy, A 
“UATE 2 a e oe ea hel E 


Part I. Orienting Attitudes and Systematic View point 


Just as in any form of psychological investigation, the conclusions 
that will emerge from a study of theorists and theories are bound to be 
influenced by the choice of methods. As usual, we have available both 
the behavioral and the introspective approaches. We can look at what 
a theorist has done and try to infer his rules of operation, or we can 
simply ask him what he is trying to do and why. The difficulty with the 
behavioral approach in this instance is that we can rarely get adequate 
protocols; a theorist is usually happy enough to communicate his 
successes, but he can hardly be expected to rush into print with his 
failures, not, at least, if he recognizes them in time. The weakness of 
the introspective approach is that we cannot be sure of the causal 
relations between a man’s scientific activities and his systematic views. 
It is my suspicion that methodologies and philosophies of science rarely 
guide either research or theory construction and in fact usually take 
form only when we try in retrospect to give a plausible rationale for 
what we have been doing. We ask, in effect, from what principles it can 
be deduced that our methods have been sound and our results good. 

Does it follow that an exposition of one’s systematic views on theory 
and method can serve no purpose but to cover up the trail of trial and 
error and make the process of theory construction appear unrealisti- 
cally logical and systematic? This would be too hasty a conclusion. If 
the explorer of a new territory has been thoughtful enough to prepare 
us a map, we may find it useful even though we would not assume 
for a minute that the map played any guiding role in the original 
exploration. 

What is the import of these remarks for our present assignment, 
which is to give an account of the development and present status, the 
characteristic assumptions, methods, and results of the statistical 
approach to behavior theory? Firstly, since I believe that this theo- 
retical enterprise represents a response to particular circumstances 
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rather than the unfolding of a philosophical position, I shall be in- 
clined to emphasize the behavioral rather than the introspective 
approach. ‘The major part of the job will be to fill in the fragmentary 
and unsystematic protocol available in the published record. This I 
hope to accomplish by chronicling as objectively as possible the 
circumstances that led to the first concrete attempts at formulating 
a statistical theory of learning and the events that have been responsi- 
ble for its subsequent growth and modification. By way of over-all 
orientation, I shall preface this account with a summary sketch of the 
systematic viewpoint that I associate with the statistical approach— 
but with the warning that the systematic viewpoint may be more a 
result than a cause of this line of theoretical development. And to 
further the comparative aims of the Project A committee, I shall 
conclude the account with an analytical summary of the morphological 
features of statistical learning theory as it appears frozen in cross sec- 
tion at the time of writing. 

To locate my systematic viewpoint in a familiar reference frame, 
we might characterize it as physicalistic, operational, and empirical. 
Individuals vary in their interpretation of these categorizing adjec- 
tives, however, so we had better pause for a bit of elucidation. 

By physicalism, I refer to a frank preference for carrying over the 
methods and goals of physical science into psychology and maintaining 
a common language in so far as it is possible to do so. Like all psy- 
chologists I have been duly exhorted to beware of modeling psychology 
on physics and regaled with example after example of once promising 
psychological theorists who succumbed to the fatal disease. But who 
has never detected a hollow ring to these exhortations or noticed possi- 
ble reinterpretations of the horrible examples? Perhaps in looking to 
physics we have taken the wrong models for psychological theory 
Construction; perhaps, on the other hand, we have taken excellent 
models but have been distracted by some of the glittering super- 
ficialities from carrying over their more essential features. As a matter 
of fact I am inclined to suspect that our habits of thinking about 
natural phenomena are so deeply grounded in the history of physics, 
and to some extent biology, that we cannot even communicate what 
We mean by scientific progress, law, or theory except by reference to 
these disciplines. If this is true, then we really have little choice in the 
Initial stages of behavioral theory construction but to start with the 
concepts and techniques of the more mature sciences and modify or 
replace these as need arises. The alternative, to strike off across a new 
area armed only with faith and intuition, would make a fine show of 
Independence, but it might also minimize our chances of profiting by 
Experience, 


384 W. K. ESTES 


“Operationism” is not, so far as I can tell, a very well-defined con- 
cept, but in common usage it generally indicates a persisting desire to 
make one’s ideas clear and a preference for concepts with direct experi- 
mental reference. Although subscribing cheerfully to these connota- 
tions, I intend further to emphasize the utility of distinguishing be- 
tween the basic descriptive terms, or “data language,” and the theo- 
retical terms associated with a scientific system or theory. It does not 
seem likely that anyone will question the importance of this distinction 
in principle. Unless we are to set a theory above both competition 
and empirical test, we must have some means of describing compara- 
tive judgments and tests without prejudicing the results; i.e., the 
theorist, the critic, and the experimenter must have a common lan- 
guage which is theoretically neutral in the sense that it does not involve 
tacit assumptions favoring either the given theory or its competitors. 

Implementing the distinction is no trivial matter, however, for our 
everyday language is saturated with informal psychological theory. 
Everyman’s theory is not necessarily always bad, but its terms and 
concepts—perception, motivation, purpose, expectation, goal, senti- 
ment, emotion, cognition, and all the rest—are so bound up with long- 
established linguistic habits that our every application of them in- 
evitably and inextricably confounds description and interpretation. 
If we are to know when we are describing and when interpreting 
behavior, we need to begin our systematizing with a clear view of 
experimental facts, a view free of distortion by unrecognized theo- 
retical assumptions. The descriptive system developed by Skinner and 
his followers [61, 85] and the deductive system of Hull’s later period 
[55, 56], despite differences on other points of methodology, attest to 
the possibility of basing fruitful psychological theories directly upon 
descriptions of observed behavior, without the intervention of intui- 
tively familiar “psychological” categories. 

It is instructive to note that both Hull and Skinner began their 
theoretical programs by attempting to remodel inherited vocabularies. 
For Hull it was purpose, goal, and idea; for Skinner, reflex termi- 
nology, which gradually faded out of the theoretical picture as empha- 
sis shifted from refurbishing jaded concepts to formulating new ones 
better attuned to the demands of experimental data. In this aspect of 
their development, these systems have recapitulated a sequence of 
events which is to be found in the history of every branch of natural 
science. Each new theoretical enterprise necessarily begins with terms 
and concepts carried over either from the vernacular or from other 
disciplines. Initially, the connotations these terms bring with them are 
often fruitful in suggesting new lines of investigation; but like other 
servants they are apt to outlive their usefulness without loosening their 
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hold on our affections. Permitted to linger on because of habit or 
sentimental attachments, they become restrictive and misleading, and 
at the same time more firmly entrenched, so that eventually we can 
dislodge them only at the cost of diverting much energy from con- 
structive research. 

These comments are not intended to set the stage for an announce- 
ment that the statistical approach to learning theory has a magic 
formula for avoiding all the travails of its predecessors. They may, how- 
ever, serve to explain why many familiar psychological terms will be 
conspicuously missing from the following pages. The omission of terms 
having to do with motivation, perception, and cognition from the 
system developed by myself and my coworkers will undoubtedly 
qualify our theories automatically as narrowly behavioristic, rigidly 
mechanical, and blindly associationistic. Still, it is not unheard of for a 
theory to prove fruitful despite these sinister attributes. Progress in 
many disciplines has rewarded those who were fortunate enough to 
be narrow and rigid in the right place at the right time. In most 
matters of form and method, the statistical approach can, I think, 
claim to have been guided more by facts than by preconceptions. In 
one respect, real inflexibility must be conceded. The inductive bases of 
statistical learning theories are strictly limited to theoretically neutral 
descriptions of behavior. It will be interesting to see whether phenom- 
ena associated with motivation, perception, and cognition can be 
reached “from below” by an approach which builds upon objective 
descriptions of behavior or whether, as many psychologists appear to 
believe, these phenomena will be reserved exclusively for more intui- 
tive or holistic approaches. 4 

In characterizing my systematic position as empirical I mean to 
signify more than an intention to keep one foot in the laboratory. I 
shall take no stand on a number of the most debated issues of system- 
atic psychology simply because I regard them as matters to be settled 

y experiment, not by argument. These include questions as to the 
proper level of analysis for psychological constructs, the reducibility 
of psychological to physiological theory, limitations on the level of 
Psychological measurement, and the degree of quantitative specificity 
Psychological theories should aim for. On such points I prefer to think 
In terms of testing hypotheses rather than defending opinions. Indeed, 
In a sense, the theoretical enterprise to be described may be regarded 
as an experiment intended to test the hypothesis that learning data are 
amenable to interpretation by abstract quantitative theories compara- 

le in predictive power to those of the physical sciences. The progress 
report to be given here will try primarily to make clear the “apparatus 
and procedure” of this experiment. Some preliminary results are 
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available, but not enough, I am afraid, to warrant the customary 
section on ‘‘conclusions.” 


Part II. The Development of Statistical Learning 
Theory 


ORIGINS AND INITIAL FORMULATION 


Emergence of the Statistical Viewpoint 


At times in the history of a science, the empirical situation seems 
uncompromisingly to demand a particular kind of theory. The de- 
mand becomes evident, of course, only after the fact when one observes 
that several very similar theories have blossomed simultaneously and 
independently. From evidence of this sort, a reader of the contem- 
porary literature on learning must conclude that the state of the field 
in the late 1940s was ripe for a probabilistic or statistical type of learn- 
ing theory. 

In retrospect, one can discern several lines of influence pointing 
toward quantitative learning theory, and specifically toward theory of 
a probabilistic rather than deterministic character. Pressures came 
from both theoretical and empirical developments of the immediate 
prewar and postwar periods. Attempts to decide experimentally among 
the qualitatively stated postulates or principles of reinforcement, con- 
tiguity, and expectancy theories had reached a virtual impasse. It 
became gradually clear that effective differential testing of assumptions 
concerning the necessary and sufficient conditions of elementary 
learning had to wait upon more precise formulations of hypotheses and 
more effective machinery for deriving their testable consequences. At 
the same time, experimental programs associated with the systems of 
Hull and Skinner were beginning to accumulate sets of orderly and 
reproducible data concerning functional relationships in elementary 
learning that appeared suitable for quantitative analysis. 

The first major attempt to formulate a general quantitative theory 
of learning, that of Hull [55], followed the classical tradition in propos- 
ing essentially a deterministic growth model. Learning was to be 
represented by the growth and decay of two hypothetical entities, 
habit strength and inhibition. Specifically, it was assumed that for any 
fixed set of conditions, each reinforcement produces a definite, predict- 
able increment in habit strength, the increments summating to an 
exponential growth curve over a series of trials, and that each response 
evocation produces a definite, predictable increment in inhibition, 
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the total amount of inhibition decaying as an exponential function of 
time following the response. A simple growth and decay model was 
clearly inadequate, however, to cope with the observed variability of 
behavior, and Hull supplemented it by postulating an autonomous 
oscillatory process which permitted the effective excitatory strength of 
a response to vary from moment to moment around the mean value 
determined by such independent variables as preceding reinforce- 
ments. The resulting hybrid model has been applied to instrumental 
conditioning data with some success, [57, 68, 86, 87], but in situations 
of only slightly greater complexity, e.g., multiple choice problems with 
partial reinforcement, it becomes all but unmanageable for calcula- 
tional purposes. Inevitably an alternative strategy to Hull’s had to be 
tried. The obvious alternative was to assume that learning is basically 
a probabilistic rather than a deterministic process and to formulate 
theoretical assumptions in terms of changing behavioral probabilities 
rather than in terms of an underlying growth process. 

During the period from about 1948 to 1951, there appeared at least 
four distinct and independent attempts to recast learning theory ina 
probabilistic form. These were Mueller’s application of a Poisson 
model to measures of conditioning [75], Miller and Frick’s application 
of information theory to operant behavior [73], Bush and Mosteller’s 
stochastic model [12], and my own first approximation to a statistical 
model for associative learning [19]. All four of these approaches are 
alike in taking probabilities of observed responses as primary theo- 
retical dependent variables and deriving relationships among experi- 
mental measures (e.g., response times, rates, relative frequencies) by 
straightforward applications of probability theory. The last approach 
attempts also to represent the independent variables of learning experi- 
ments in terms of statistical distributions of events and may appropri- 
ately be termed statistical learning theory to distinguish it within the 
more general class of probabilistic or stochastic theories. x 

The interrelationships of statistical learning theory and its con- 
temporaries will have to be studied in some detail if we are to attain a 
reasonable perspective with respect to the current theoretical scene; 
this we shall try to accomplish. For convenience in exposition, I shall 
first review the development and present status of the particular 
theory with which I have had firsthand experience, and then under- 
take a comparative analysis in a final section. 

In one sense, statistical learning theory is by no means new. Many 
of the basic ideas can be traced back at least through a generation or 
two of behavioral theorists, and some of them have been developed 
along different lines by other contemporary theorists. The question of 
genealogy I shall not try to push very far. The more remote origins of 
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the present theory would be hard to track down, and the task of 
separating the influences of heredity and environment is well known 
to be always difficult and frequently unrewarding, even when one is 
not working under the handicap of personal involvement. 

The mathematical model which provided the formal basis for my 
1950 paper on statistical learning theory [19] was initially formulated 
in the summer of 1948. During the preceding year, I and another 
Skinner-trained experimentalist, Norman Guttman, had finished 
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constructing a fairly elaborate apparatus for studying operant be- 
havior in the rat [47], and by way of breaking in the apparatus we 
had replicated several of Skinner’s basic experiments on acquisition 
and extinction of the bar-pressing habit. The acquisition curves ob- 
tained during this “warm-up” period furnished the immediate instiga- 
tion to formulate a statistical model for learning. Some of the proper- 
ties of these curves which seemed to cry out for quantitative treatment 
will be apparent from the examples shown in Fig. 1. Let us now at- 
tempt to reconstruct the reasoning which led from data to theory. 
Perhaps the most striking features of the acquisition curves are 
their orderliness and reproducibility. The course of acquisition is 
typically a smooth acceleration to a stable asymptotic response rate; 
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the form of the curve is very similar from one animal to another, and 
(note, e.g., curves A and Bor C and D in Fig. 1) from one occasion to 
the next for a given animal. But where does one go from here? The 
cause of science will not be properly served if we do no more than mount 
our fine graphs on the laboratory wall where they are readily avail- 
able for rapturous contemplation. If the important features of the 
course of acquisition are to be summarized, communicated, and inte- 
grated with other findings, we must have a mathematical description 
of these curves. And since the curves are smooth and reproducible, 
obtaining such a description should not be difficult. In fact, upon try- 
ing our hand at a bit of curve fitting we find that on the contrary, it is 
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too easy. Any number of simple mathematical functions can be fitted to 
the observed acquisition curves. In Fig. 2 we see the result of fitting a 
single empirical curve with two different mathematical functions. If we 
require only descriptive adequacy, then our choice of functions is 
clearly going to be quite arbitrary. If we are not willing to settle for 
an arbitrary choice, then some further reflection is in order. j 
Why are we so reluctant to make an arbitrary choice of functions? 
Obviously because we are hoping eventually to achieve more than an 
accurate description of acquisition in one particular situation. We 
would like to arrive at a description which will hold for a variety of 
situations. And we would like to find that the parameters in our func- 
tions have theoretical significance, i.e., that their values reflect 
manipulations of variables in some reasonable way. These aspirations 
bring us up against a conservation law which has been recognized more 
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or less explicitly since the time of John Locke to hold for all theoretical 
undertakings. Theories transform information; they do not create it. 
If we hope to get more out of our mathematical model than a descrip- 
tion of a particular set of facts, then we will have to take a broader 
range of facts into account when formulating the model. 

One additional source of relevant information concerning the 
acquisition of bar pressing is immediately at hand. Although we were 
initially most impressed by the smoothly accelerated acquisition curves 
generated by the majority of animals, we have not overlooked the fact 
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that exceptions occur. The exceptions are of two main types, as illus- 
trated in Fig. 3. Some animals, e.g., those represented by curves A and 
C in Fig. 3, exhibit an almost instantaneous shift from near-zero to 
near-maximum rate of responding. On the other hand, some animals, 
e.g., those represented by curves Band C in Fig. 3, make a few re- 
sponses and appear to be following the normal course of acquisition, 
then inexplicably cease responding, sometimes for considerable 
periods. Clearly all these cases cannot readily be handled by a model 
which assumes that acquisition is described by the same simple func- 
tion in all animals. We might get around the difficulty by assuming 
that there are several types of learners, each with its own learning 
function. But this way out would yield little satisfaction. We have no 
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independent criteria for classifying types of learners; and once started 
on such a course we could look forward to nothing but an endless series 
of ad hoc classifications as the same problem recurred in new situations. 

Before giving serious consideration to any classificatory scheme, we 
might well examine our working assumptions. Conceivably our initial 
efforts at model building have been guided by some tacit assumptions 
which reflect traditional views about the nature of learning rather 
than the properties of the data we are trying to account for. This line of 
inquiry brings an immediate flood of light. In looking for a simple func- 
tion to describe acquisition in all animals, we have followed the lead of 
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numerous predecessors [e.g., 41, 45, 55] to the extent of tacitly assum- 
Ing that learning may be treated as some sort of deterministic growth 
Process. Once this assumption is made explicit, we see that it is by no 
means a necessary one. Possibly the learning process is basically 
Statistical. The changing correlations between empirical response and 
Stimulus variables which provide the observable symptoms of learning 
might result either from the formation of many associative relations, 
bonds, or connections at a more “molecular” level, or from the forma- 
tion of unitary “molar” connections on a probabilistic basis. If there 
Were trial to trial sampling variations at either the “molecular” or the 
“molar” level, the typical smooth acquisition curve might be a result- 
ant of averaging. At the same time the atypical results shown in Fig. 3 
might also be accounted for, since in any statistical process exceptional 
Sequences are bound to occur occasionally. 
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Closer examination of the “typical” acquisition curves brings fur- 
ther evidence in support of a statistical hypothesis. Initially we con- 
sidered only cumulative curves of the sort shown in the first three 
figures. If we amplify these records, or better yet take noncumulative 
records on a faster tape, and measure the times between successive 
responses, we find that even the smooth curves of Fig. 1 are the result 
of an averaging process. In Fig. 4 we can see what very different im- 
pressions concerning the nature of the learning process are generated 
by a cumulative response curve as compared to a frequency distribu- 
tion of interresponse times for the same data. 

If we yield to the demands of the data and adopt provisionally the 
view that acquisition is a basically statistical process, then we must also 
change our whole approach to the problem of model construction. We 
shall no longer expect to find any one single equation that will describe 
learning in all animals. Rather than continue what is apparently 
destined to be a fruitless search for “the learning function” we must ask 
what particular set of concepts and assumptions will enable us both to 
deduce an average curve descriptive of the over-all course of acquisi- 
tion and at the same time to account for deviations from group trends. 


Formulation of a Probabilistic Response Model 


Since we wish first of all to arrive at precise descriptions of ele- 
mentary forms of learning, we have little latitude in our initial choice 
of dependent variables. In the kind of experimental situation we have 
taken as our empirical starting point, the response variable to be used 
in tracing the course of learning is chosen at will by the experimenter. 
The behaviors available to the organism at any given time are classi- 
fied in accordance with objective criteria. In this context, then, the 
term response refers not to a physiologically or anatomically defined 
unit, but to a class of activities, e.g., all behaviors which result in 
depression of a bar, all behaviors which result in cutting a photocell 
beam or entering an end box of a maze, all behaviors which eventuate 
in the appearance of the syllable VEC in a particular blank space of 
an answer sheet. Testable predictions of behavior in the Skinner box, 
T maze, or paired-associate learning situation must refer to frequencies 
or times of these operationally defined “responses.” Yet we can only 
afford to follow the lead of the experimenter up to a point. It would be 
extremely inconvenient for theoretical purposes to have one set of 
acquisition laws relating to response frequencies, another set relating 
to latencies, and so on. Some unifying construct is necessary. 

We could at this point simply introduce a new construct, similar to 
Hull’s s#p and relate it to the various experimentally defined response 
measures by special postulates. Such a course would have the obvious 
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attraction of allowing us to get on quickly to the more central prob- 
lems of learning; but it would have the equally obvious drawback of 
leaving us with a number of arbitrary functions intervening between 
our learning theory and the testable predictions derivable from the 
theory. We would have to look forward to continual difficulties in 
deciding whether discrepancies between theoretical predictions and 
experimental results reflect inadequacies in our assumptions about 
learning or merely unfortunate choices of the intervening functions. 

Our unwillingness to face an unending sequence of ad hoc assump- 
tions about response measures in each new situation, together with 
the fact that we have found strong reasons to regard behavior as 
essentially probabilistic, suggests a possible way out. Suppose that we 
take as our basic theoretical dependent variable the probability of 
Occurrence associated with any given operationally defined response 
class. The desired unification can then be achieved by formulating 
laws of learning in terms of response probability as a dependent 
variable. At the same time the need for a multiplicity of postulates 
relating to response measures is minimized. By taking time out for 
Some paper and pencil investigations, we find that by applying the 
mathematical theory of probability we can derive once and for all the 
relations between response probability and such empirically definable 
measures as frequency, rate, and latency or reaction time. Further, it 
becomes clear that once we have laws of learning stated in terms of 
changes in response probability, continued application of the same 
mathematical theory will enable us to generate predictions of empirical 
relationships involving such measures as trials to a criterion of learning, 
errors to a criterion, responses to extinction, and so on. 7 

Let us digress for a moment in order to illustrate the last point. 
Suppose that the minimum time required to execute a given response 
ish min. Then in a free responding situation the maximum rate of 
Occurrence of the response is certainly 1/4 responses per minute; and 
it is fairly obvious that if the probability of the response during any one 
interval % is p, then the average rate of responding will be 
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For example, if A = 0.05 min and p = .75, then the maximum rate of 
responding is 20 responses per minute and the average predicted rate 
1s 15 responses per minute. Further it can be shown [19] that if b is 
constant within a trial, the mean predicted latency of the response will 
be given by 


pe (2) 
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-Now suppose that a learning theory yields the following acquisiti 
law for a given situation, 


b =1- 0-0 


where pn is response probability after n — 1 reinforced trials and 0 i 
parameter 0 < © Z 1. We cannot test the law directly, since pro 

bilities are purely theoretical entities, but we can readily obtain 

predicted relationships among observables. Thus for the situation 

question we must predict that the rate of responding will be related to 

number of reinforcements by the equation 

LI 4 gm 

Bayt h h a 9) 2 

where rp is to be read as “average rate after n — 1 reinforcements,” 

and that mean latency will be related to number of reinforcements 


al h 
1-(d- 0) 


These expressions result simply from substituting Eq. (3) into Eqs. 
and (2), respectively. A little algebra yields still other relationshi 
If pn represents probability of a correct turn in a T maze, then 1 — 
must represent probability of an error. Therefore e(K), the predic! 
number of errors in X trials, can be obtained by adding the values of 


Ln 


Wap akon 1 ae 
K 
eK) = X (1 =p) 
ir. 
= > (1 — Hy" 
n=1 
-1-0-0 


8 


The technique for performing the summation is given in [82], and in 
almost any text on college algebra. As K becomes large, (1 — 8 
approaches zero, so the expected total number of errors during le 

ing is equal to 1/0. Thus given the form of the acquisition function a! 
the value of the parameter © for a given subject, we can make pred 
tions about rate, latencies, total errors, and many other measures. Hi 
we can ascertain the value of O is a pertinent question, but to deal wii 
it now would be getting ahead of the story. We shall come back to t 
question later. For the present, I wish only to emphasize the fact that 
once response probability has been chosen as our basic dependent 
variable, it can be related to a variety of observables by derivatio 
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which depend only on the assumptions common to all applications of 
probability theory. 

One may also note that our response model is in all essentials the 
same as the one assumed tacitly or explicitly in many other branches of 
psychology, e.g., psychophysics, test theory, communication theory. 
In one salient respect our strategy is almost exactly the reverse of that 
which has typified many preceding learning theories. The more com- 
mon procedure has been to start by formulating assumptions or 
postulates about learning and only at a later stage to begin worrying 
about making contact between the theory and observables. One of the 
distinctive features of the present approach is that in the early stages 
of theory building we have concentrated our efforts primarily on 
establishing a secure foundation. We have no guarantee that our 
approach will lead to anything passable as a learning theory, but we 
can have some confidence that if we do arrive at a theory, it will not 
be found floating idly in mid-air while the constructors try belatedly 
to drop a few ad hoc mooring lines down to the plane of observables. 


Stimulus-Response Relationships and Learning j 


Construction of what appears to be a serviceable response model 
represents tangible progress, but it does not quite complete our 
theoretical foundation. Before we can proceed with the superstructure, 
we will have to deal with one additional preliminary matter, namely, 
the question of precisely what we are going to mean by “learning.” 
So long as we thought only in terms of a particular set of data this 
question did not arise. Everyone will agree that acquisition of the bar- 
pressing habit, as represented by the curves of Fig. 1, is an instance 
of learning. Starting from this example, we might surmise that learn- 
ing can be provisionally defined as a systematic change in response 
probability. Not all systematic changes in response probability will 
qualify, however. If the experimental sessions had been sufficiently 
prolonged, the curves shown in F ig. 1 would have begun to decelerate 
and rate of bar pressing would have fallen nearly to zero even though 
all responses were being rewarded. We would want to attribute the 
decline in response probability to satiation, however, not to unlearn- 
ing. Similarly, if we changed the situation radically, e.g., by turning 
off the lights in the boxes, by adding buzzers or vibrators, or by chang- 
ing the inertia of the bar, we would certainly produce a change in 
Probability of bar pressing; but again we would want to distinguish 
this from the change in probability that occurred during acquisition 
and from the change that would occur if we disconnected the magazine 
and permitted extinction to occur. ; 

These distinctions are apparently based on the extent to which the 
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changes in response probability can be reversed without additional 
exposure to the given experimental situation. If response probability 
declines as a result of feeding, the original level can be restored by a 
period of fasting; if probability is changed as a result of a shift in 
stimulus conditions, the original probability will again obtain if we 
restore the original stimulus conditions. But the changes produced by 
a series of reinforced learning trials evidently cannot be entirely 
undone except by extinction trials, and the changes produced by 
unreinforced trials cannot be entirely undone except by reacquisition. 

Further, it seems clear that we want to include under “learning” 
only changes in response probability relative to given stimulus variables. 
In the elementary operant conditioning situation, response probability 
is defined relative to the entire stimulating situation. But we can foresee 
that later we will want to deal with experiments in which learning is 
represented by changes in response probability relative to some inde- 
pendently manipulable component or property of a situation. In the 
Skinner box, the rat may learn to press the bar only when a tone is 
sounding; in a paired-associate experiment the subject learns to make 
a given response only in the presence of one particular stimulus word, 
and so on. We can allow for all these possibilities if we agree to con- 
ceptualize learning in terms of probability relations (referred to in the 
sequel as conditional relationships, or, for brevity, connections) 
between operationally defined response classes and operationally 
defined classes of stimulating situations. 

These preliminaries disposed of, we move into an area of consider- 
ably greater uncertainty. The decisions we have made up to this point 
have been dictated closely by empirical considerations. Barring major 
changes in our over-all objective or our underlying philosophy, the 
probabilistic framework we have developed should remain relatively 
fixed while we proceed to fill in the outline with detailed assumptions 
concerning the nature of stimulus response relations and the manner of 
their formation and abolishment. We shall try to continue following 
the facts, but we may not always be sure where they are trying to lead 
us. Probably the most we should hope for is that by trying out reason- 
able appearing assumptions, testing their consequences, and modifying 
them as necessary, we may converge through a series of approximations 
toward an adequate theory. 

In order to make further progress toward a quantitative learning 
theory, we now must make some specific and detailed assumptions 
about the form of the probability relations between stimulus an 
response variables. One of the first questions to be settled is whether We 
shall conceive the relation between a response class and a stimulating 
situation to be unitary or to be composed of more elementary com 


The Statistical Approach to Learning Theory 397 


ponent relations. Two considerations nudge us toward the latter 
alternative. First, a considerable body of data from elementary learn- 
ing experiments (e.g., experiments on “continuity” in discrimination 
learning, stimulus compounding, transfer of training, retroactive 
inhibition, stimulus generalization) seems to demand a property which 
we might call ‘‘transposability.”’ By this I refer to the fact that when a 
period of learning has increased the probability that a complex 
stimulus will evoke a given response, then, other things equal, various 
components or aspects of that stimulus will exhibit an increased 
probability of evoking the response if tested in a new situation. This 
property of transposability will characterize our theoretical model if 
we assume that the conditional relations between a response class and 
the various components or aspects of a stimulating situation are 
mutually independent. In order to be noncommittal, for the moment, 
about the nature of the components or aspects, let us refer to them 
simply as elements. It will be a task for future research to determine 
just what features of a stimulating situation may correspond to the 
stimulus elements defined in the model. 

The second consideration favoring a “molecular” hypothesis is the 
necessity of accounting both for the typical gradual course of learning 
illustrated in Fig. 1 and the abrupt changes seen in Fig. 3. The appear- 
ance of both types of curves under apparently identical circumstances 
suggests that some kind of sampling process might be involved. And 
the term sampling at once brings to mind two ideas which prove ex- 
tremely helpful in the touchy business of converting our somewhat 
vague and qualitative notions into an explicit quantitative model— 
one is a chemical analogy and the other is Guthrie’s treatment of 
learning by repetition. 

The chemical analogy will become apparent if we pause a moment 
to summarize the properties which appear to be demanded of an 
acquisition model: Puta 

1. The average course of acquisition is given by a curve which is 
negatively accelerated in its terminal phase and which approaches 
a stable asymptote. A 

2. The smooth mean curve is the result of averaging over a Pois- 
son-like distribution of latencies or interresponse times, and the 
asymptote of the observed learning curve represents merely a point of 
Statistical equilibrium. j : 

3. A relation between observed stimulus and response variables is 
decomposable into numerous component relations which are formed 
Via some kind of sampling process. An 

We see that the quantitative properties which characterize simple 
acquisition are the same as those which characterize mass-action 


398 W. K. ESTES 


chemical processes. Therefore it is reasonable to hope that the mathe- 
matical model which is known to describe the latter will provide at 
least a good starting point for a description of the former. 

With our preliminary analyses completed and the chemical 
analogy to guide us through the next stage, we are finally ready to set 
down our assumptions in quantitative form. We recall that the simplest 
mass-action chemical process is described by a relation of the form: 


Ax = q(S — x) (4) 


where x is the momentary concentration of a given reaction product, § 
is the maximum concentration, g is a proportionality constant, and 
Ax is the change in x during a unit time interval. The formula can be 
reinterpreted in terms of the learning situation. If we represent a 
stimulating situation by a set, or population, of stimulus elements, 
then $ will be the number of elements in the set and q will be the 
proportion of these elements representing the effective stimulation on 
any one trial. 

We have already decided that the organism’s possible behaviors in 
the situation are to be categorized into operationally defined classes. 
For simplicity we shall limit ourselves at the start to experiments in 
which only two response classes, call them A; and A», are recognized. 
(In the operant conditioning experiment, for example, A; will be any 
behavior which results in depression of the bar during a given time 
interval and A, will be any behavior which does not result in depres- 
sion of the bar during this interval.) Continuing the interpretation, 
we assume that at any time each element in the stimulus set S bears 
a conditional relation to (is connected to) one or the other of these 
response classes. This relation is such that if all elements are equally 
likely to be sampled on any trial, the probability of A; is equal to the 
proportion of elements connected to Aı. If A; is the response that is 
being recorded during learning, then the quantity x in Eq. 4 represents 
the number of stimulus elements in $ that are connected to A; at a 
given time, and Ax represents the average change in the number of 
elements connected to A; on any one learning trial. (In the bar-press- 
ing experiment a “‘trial” will correspond to the occurrence and rein- 
forcement of a bar-pressing response.) Since by our assumptions the 
ratio x/S is equal to p, the probability of response A1, we can divide 
both sides of Eq. 4 by S$ and obtain the relation 


Ap = 40 — p) = O(1 — p) 
or equivalently Prat = Pn + O(1 — py) (5) 
where n represents ordinal number of the trial and the constant of 


proportionality q has been relabeled © in accord with the notation used 
in later work. 
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Before discussing the empirical implications of Eq. (5), it may be 
helpful to summarize the concepts and assumptions underlying it. 

Response variables. The organism’s possible behaviors in the 
given experimental situation are categorized into mutually exclusive 
and exhaustive response classes on the basis of objective criteria, With 
each response class, we associate a probability. In the operant condi- 
tioning situation, the momentary probability of the bar-pressing 
response is the theoretical dependent variable. Relations between 
response probabilities and experimentally defined response measures 
are derivable by application of probability theory. 

Stimulus variables. The stimulating situation is represented by a 
set of S stimulus elements and the effective stimulation on any trial is 
represented by a randomly drawn subset of elements. At any time 
each element in $ is connected to exactly one response class. As a first 
approximation it is assumed that all stimulus elements are equally 
likely to be sampled and that the probability of a response at any time 
is equal to the proportion of elements in $ that are connected to it. 

Learning. On any acquisition trial all stimulus elements sampled 
by the organism become connected to the response reinforced on that 
trial, 

It will be noted that our rudimentary theory shares several impor- 
tant properties with Guthrie’s [46] treatment of associative learning. 
On any one trial, learning occurs on an all-or-none basis relative to the 
components or aspects of the stimulating situation which actually 
affect the organism, but not all components are available on any one 
trial. Over a series of trials, response probability changes by a series of 
discrete jumps, the size of the jump being a random variable; but if we 
average over a number of replications of an experiment (i.e., over the 
set of empirical learning functions generated by a group of organisms 
or by a series of learnings and relearnings by the same organism), the 
random variation will be smoothed by the averaging. The mean curve 
May turn out to be as simple in form as that arising from a continuous- 
growth postulate such as Hull’s. 

It might seem at a first glance that our model is incomplete, even as 
a limited theory, in that we have not introduced any assumptions 
about extinction, or unlearning. Further analysis reveals, however, 
that the assumptions already made provide a mechanism for unlearn- 
ing. At any time the response which is reinforced receives an increment 
In probability according to Eq. (5). Since.the response classes defined 
for a given situation are mutually exclusive and exhaustive, their 
Probabilities must add up to unity at all times; therefore, any increase 
in the probability of one response must be accompanied by a com- 
Pensating decrease in the probability of other responses. If, for exam- 
ple, response classes A;, Aj, and A, are defined for a given situation, 


400 W. K. ESTES 
and if the probability p; of response A; increases by the amount 
Ap; = O(1 — b) 


on a given trial, then the combined probability of A; and A, must de- 
crease by the amount 


Alp; + pr) = —O(1 — pi) = —O(p; + px) 
If the two changes are added together, we get 
Api + Alp; + px) = O(1 — pi — pj — px) = O(1 — 1) = 0 


as required. 

This result may be stated another way in terms of the model. 
Suppose a particular element of the stimulus set $ is connected to 
response A;; if now this element becomes connected to a different 
response A; on some learning trial, it is at the same time disconnected 
from A;. In more familiar psychological terms we could say that given 
an association, or conditioning, principle and a probability model for 
response variables, a principle of unlearning by interference follows as 
a theorem. 

With a provisional model in hand, we are naturally eager to try 
some applications. One technicality must be taken care of first how- 
ever. Our assumptions concerning the “hows” of acquisition are com- 
pletely summarized in Eq. (5), but the latter cannot be used directly 
to describe empirical acquisition curves. Equation (5) is a difference 
equation, a mathematical form which may be destined to play much 
the same role in behavioral theories that differential equations have in 
physics. Using this mode of expression, we can state basic laws and 
assumptions in general terms, without reference to the particular 
conditions peculiar to individual experiments. Then starting with the 
general law in the form of a difference equation, and taking due ac- 
count of the experimental conditions characterizing a given situation, 
we can derive expressions which relate observables (e.g., response rate 
and time, latency and number of reinforcements) by explicit formulas 
suitable for comparison with empirical curves. If the rate of change in 
response probability were given by the differential equation 


dp _ 
i Ne) 


we could integrate the latter and obtain a formula expressing p as 4 
growth function of n. Happily, it turns out that we can carry out an 
analogous mathematical operation on difference equations such as 
our Eq. (5). The operation is called finite integration, or more gen- 
erally, “‘solving” the difference equation. The derivation is not difficult 
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[for details, cf. 13, 37, 60, 82] and yields the formula 
p=1-(-p)0 = O (6) 


where pn» represents probability of response A; on the nth acquisition 
trial.? Just as we did before in the case of Eq. 3, we can substitute from 
Eq. (1) or (2) into Eq. (6) and obtain learning functions in terms of 
response rate or mean latency.’ The fact that these functions are in- 


2 For our present purposes it really does not matter how we arrive at Eq. (6), pro- 
vided we can prove that it is correct. We had better run through the proof briefly in 
order to leave no gaps in the route of anyone who is trying to follow the argument 
closely. We start by considering the hypothesis that Eq. (6) is correct. If so, response 
probability after the next trial should be given by 


pour = 1 — (1 — pi)(1 — 9)" 


and by subtraction we should get the change in probability resulting from a single 
acquisition trial 


Abn = Pasi — pn x 
1 apa op —1 + = pl - om! 
(1 — py — Oy"? (1 — O)r] 

(1 — pi) — Oo) — 1 + 8) 
8 


(1 — p(t — ©) 
Referring back to Eq. (6) we see that 

(1 — p:)(1 — Or = (1 — pr) 
and, therefore (by substitution into the last line above) 
Apn = O(1 — pn) 


which is precisely what we should have if Eq. (6) is the solution of Eq. (5). 

3 By further mathematical operations we can if desired change the independent 
variable from trials to time. To obtain a new difference equation representing the 
expected change in response probability during a unit time interval, we multiply the 
right side of Eq. (5) by the momentary response probability p, yielding 


Ap = Op(1 — $) (7) 


This mathematical manipulation amounts to assuming that p is changed at each 
reinforcement in accordance with Eq. (5) but remains constant during intervals be- 
tween successive responses. Equation (7) is not so easily solved as Eq. (5), but it can 
pe shown [cf. 12, 19 for details] that when O is small an approximate solution is 
Biven by 


C a a a 


1 
NEET en (8) 
P= HI — pi/pile™ 


where p, is response probability at time #, e is the base of natural logarithms, and 

= O/h (as before, h represents the minimum time required for execution of the 
response whose probability is p). Substituting r for p by means of Eq. (1), we now can 
rewrite Eq. (8) in terms of rate of responding, and the cumulative form of this rate 
Curve, illustrated in Fig. 2, should describe cumulative acquisition curves in operant 
Conditioning situations. 
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deed appropriate in form to describe empirically obtained curves is 
illustrated in Fig. 5; curves derived from Eq. (6) are fitted to curves of 
median running time against number of reinforcements from a run- 
way experiment, mean bar-press duration and mean response rate 
against number of reinforcements from bar-pressing experiments, and 
number of errors against reinforced trials from a T-maze experiment.! 


8 osr 
g [ pipe 024 _ 
£6 = OS 1-069 (062) 
i © 
& 5 t 
$4 304 
= = 
© 2 027 (8) 

0 Bez L fi iE 

o 10 20 30 40 50 eL 3. 4 o 
Number of reinforcements 100 trial block 

5 15} 
lgi aj A 165 
5 £E or n 1-089 0.90)" 
o3 = 
g € 
85 e(n) = 5.26 (1-08?) € 
E 4 
5 ar 
oO, 

0 ee ay ee 

0 5 10 15 GRAE ve 1a: OME 
Trial Trial 


Fic. 5, Four kinds of simple acquisition data described by functions derived from 
statistical learning theory: curves A, B, C, and D, respectively, represent rate of 
bar pressing vs. number of reinforcements, mean bar-press duration per 100-trial 
block, mean cumulative errors vs. trials in T-maze learning, and median running 
time vs. trials in a runway experiment, 


Let us review our progress briefly. We started out to formulate a 
model which would provide quantitative descriptions of operant 
conditioning curves and at the same time account for deviations from 
the typical curve forms. In some respects we now stand fairly well. 
From our provisional model it has been possible to derive acquisition 
functions, not only for the bar-pressing experiment, but for a variety 

‘I am indebted to Dr. Solomon Weinstock for the running-time data. The bar- 


pressing data come from unpublished studies by the writer, and the T-maze data from 
a study by Estes and Lauer [36]. 
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‘of elementary learning situations, Further, by virtue of the assumption 
that learning occurs on an all-or-none basis relative to the sample of 
stimuli effective on any trial, the model is able to provide both for 
sudden and for gradual learning, both for variability among individual 
protocols and for orderly group trends. When we take a mean curve 
from a group of animals in a maze experiment or cumulative responses 
over time in a bar-pressing experiment, the trial variation averages out, 
and we obtain the typical smooth acquisition curve. We have shown 
how the forms of the average curves can be deduced from the model; 
by further mathematical development, we can also derive functions 
for the standard deviations around these curves and even, in some 
cases, descriptions of the complete distribution of response measures 
on any trial [for details see 13, chap. 14; 32]. 

Appropriately enough, we conclude our reconstruction of the 
formative period of statistical learning theory just as we started—im- 
mersed in the idiosyncrasies of acquisition data. We may have done a 
little more than come a full circle, however, for we return with a small 
but functional theoretical model under our arm, and we find with 
some pleasure that the model can handle a larger collection of facts 
than the very limited set we started with. Also, it will soon become 
clear that en route we have turned up some new and perplexing prob- 
lems which cannot be handled by the 1950 model and which start 
us off on another cycle. 


Reinforcement and Contiguity 


Before proceeding with further adventures, we should try to clear 
Up some questions the reader may have as to how the 1950 model 
Stands relative to the explanatory concepts which appear in other 
contemporary learning theories. Do we have a contiguity theory or a 
reinforcement theory? ’ 

In my 1950 paper [19] I attempted to formulate my assumptions 
about the conditions of elementary learning in terms of Guthrie’s con- 
tiguity theory as I understood it. A response was assumed to become 
conditioned to all stimuli affecting the organism at the time of its 
evocation. The term reinforcement was regarded simply as a convenient 
label for procedures which serve to evoke responses in the presence of 
hew stimuli and to protect newly formed associations by removing the 
Stimuli before interfering responses can occur. Presumably one could 
derive from the contiguity assumptions an account of the fact that 
Some events or procedures are effective reinforcers and others are not. 
I still find these ideas attractive in a programmatic sense, and in Part 
TII of this article I shall try to summarize the progress to date in 
working out the program. 
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To anticipate a bit, I might remark that early in the development 
of statistical learning theory, it became clear that research could not 
wait upon an adequate definition of reinforcement in terms of more 
primitive concepts. In order to make empirical applications of the 
developing theory, it was expedient to take advantage of the fact that 
there are a number of commonly studied learning situations for which 
all experimenters can agree that certain operations are reinforcing, 
i.e., lead to acquisition when appropriately correlated with response 
occurrences. Thus it is natural enough that our quantitative theory 
yields statements, not about what reinforcement is, but about how it 
operates. All theorems (laws) about learning derivable from the theory 
take the form of “if . . . then” statements; e.g., “if a given operation 
yields an increment in response probability satisfying Eq. (5), then 
repeated reinforcement of a response via this operation will generate 
a mean learning curve of the form described by Eq. (6).” Later on we 
shall see that deductions of this type sometimes yield predictions of 
genuinely new empirical phenomena. To the extent that such predic- 
tions are both intrinsically interesting and empirically confirmed, the 
theory is accomplishing precisely what we ask of a theory and no one 
should complain if we continue to operate on the same descriptive 
level indefinitely. As the theory is applied to a variety of experimental 
situations primarily to refine and extend our account of how learning 
proceeds under various schedules, conditions, and contingencies of 
reinforcement, we incidentally accumulate considerable information 
as to the kinds of operations which serve as reinforcers. 

I am convinced that given a sufficiently austere temperament one 
could develop a fruitful descriptive theory without ever straying into 
speculative bypaths. The proposition is largely academic though, for 
where are we to find the theorist who never finds himself wishing, 
albeit guiltily, for a deeper understanding of some empirical phenom- 
enon that can be provided by the most detailed quantitative descrip- 
tion? The student of learning is especially beset by temptations. The 
fact that a wide variety of superficially diverse operations turn out t0 
have quantitatively equivalent effects on learning demands some kind 
of explanation, and almost every learning theorist has tried his hand at 
answering this call. In the case of me and my associates, the drive for 
explanation has taken the form of continuing attempts to derive the 
independently established quantitative laws of reinforcement from 
more primitive assumptions concerning contiguity, interference, a? 
stimulus sampling. 

For certain areas of application, especially classical conditioning 
and the simpler types of verbal learning (e.g., paired-associate learn- 
ing), the contiguity principle seems to prescribe the conditions of 
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reinforcement more directly than does any alternative theory. In 
more complex situations involving rewards, punishments, and the like, 
contiguity interpretations become more speculative and really cannot 
be defended against alternative theories on grounds of parsimony. 
Whether the interpretations are worth while comes down to a question 
of whether they have fruitful consequences in the theoretical and 
experimental analysis of new empirical situations. As we follow the 
more recent development of statistical learning theory in these pages, 
we shall see a number of instances in which the contiguity principle 
has been useful in mediating extensions and applications of the theory 
to new problems and situations. I believe it is the function of contiguity 
theory as a tool of analysis that is responsible for its continued viability. 

In brief, our answer to the question “reinforcement or contiguity?” 
is simply, “both.”” Whether we can have our cake and eat it and still 
not grow fat is a much more difficult question. Some materials which 
may be of use in formulating an answer to it will be provided in the 
following sections. 


GROWTH OF STATISTICAL LEARNING THEORY SINCE 1950 


The year 1950 marks a turning point in our narrative. In the pre- 
ceding section we could speak of “the theory” or “the model” with no 
teal ambiguity. We had only to deal with a single clearly defined math- 
ematical model and its range of application was limited to an almost 
as clearly defined class of elementary acquisition experiments. In 1957, 
the problem of communication is much more formidable. During the 
intervening years, many people have had a hand in the enterprise. 
Many generalizations and extensions of the original theory have been 
attempted—some proving obviously fruitful, some as obviously nega- 
tive, some indeterminate. Giving a neatly organized account of these 
developments is made difficult by the fact that they have not been 
directed by any master plan. Rather, we have been led along by a 
Series of problems. Some of these are primarily empirical problems, 
Some primarily theoretical; the principal feature that they have in 
Common is that none is ever solved without first giving rise to at least 
two more. For expository purposes it will be convenient to organize the 
following account around two main groups of problems: (1) those 
arising from attempts to provide direct experimental tests of theoretical 
assumptions and (2) attempts to generalize the original theory by 
removing various simplifying assumptions and to extend it to new 
empirical areas. 

Tests of theoretical assumptions. Invariance of parameters. As Pi 
have illustrated in preceding sections, the experimental application 
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of a quantitative theory involves a number of steps, starting with the 
abstract and formal assumptions of the theory and terminating with 
derivations of predicted functional relationships among observables. 
The last phrase must be qualified, however: the dependent and inde- 
pendent variables in the derived functions are operationally definable 
in terms of observables, but the same is not always true of the param- 
eters. Consider, for example, Eq. (6) above. As written, the dependent 
variable p, is purely theoretical; but since we have assumed a proba- 
bility model for response variables, we know that if Eq. (6) is to be 
applied, say, to a T-maze experiment, the theoretical variable p, can 
be replaced by an observable fn, the expected proportion of occur- 
rences of a given response by a group of similar organisms on the nth 
acquisition trial. With this identification agreed upon, we are in a 
position to provide numerical values for all of the terms in Eq. (6) 
except the parameter ©. 

Now we must face up to the question, postponed earlier, of precisely 
how we are to evaluate © in any particular experiment. We know that 
in terms of our conceptual model, O represents rate of learning in terms 
of the average proportion of stimulus elements sampled per trial from 
the stimulus set, or population, representing a given stimulating situa- 
tion. The conceptual definition is not of much help, however, for we 
have no means of identifying individual stimulus elements and deter- 
mining when they are sampled. From the definition we should expect 
that, other things equal, the value of © should depend on the observ- 
able variability in the stimulating situation, but it must also depend in 
part on characteristics peculiar to the individual organism. One could 
hardly avoid assuming, for example, that a visual defect would influ- 
ence the organism’s sampling of its visual environment, or that grada- 
tions in auditory acuity would be reflected in rate of sampling of 
auditory cues. 

These considerations lead to a methodological restriction which I 
believe will hold for all behavioral theories. Any theory which predicts 
individual behavior in quantitative detail will contain parameters 
whose values depend at least in part on characteristics peculiar to the 
individual organism, and in general, the values of these parameters 
can be determined only from behavioral data. In principle, one must 
concede that ultimately the values of some behavioral parameters may 
turn out to be predictable from physiological or biophysical theories, 
but there appears to be no danger of arriving at this stage in the fore 
seeable future. To return to our illustrative situation, it becomes cleat 
that the value of © for any particular organism or group of organisms 
must be obtained by statistical estimation, i.e., curve fitting or some 
equivalent procedure. 
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All estimation procedures amount roughly to this: given a set of 
empirical data, e.g., any of the empirical curves plotted in Fig. 5, 
which should be describable by a given theoretical function, one selects 
some criterion of goodness of fit, e.g., least squares or maximum likeli- 
hood, then by appropriate statistical procedures selects the parameter 
values which yield the best agreement between the theoretical func- 
tion and the data. (In Fig. 5, the value © = .19 yields the best fit of 
Eq. (6) to the empirical curve C according to a least squares criterion.) 

The necessity for evaluating parameters from the data to be de- 
scribed raises a question of circularity in the theory. In what sense, a 
critic may ask, are we making theoretical predictions about learning if 
application of the theory always includes this step of parameter 
estimation? The point is well taken, and deserves detailed considera- 
tion. In order to answer the question fully, it is necessary to distinguish 
three levels of prediction, the first involving only extrapolation; the 
second, overdetermination; and the third, situational invariance. 

The first form of prediction is the most that can be accomplished 
when we fit a function such as Eq. (6) to a single set of empirical data. 
If all the data are used in evaluating the parameter ©, then we really 
cannot be said to be predicting the behavior in any sense. All that can 
be claimed is that the application provides a test of the theory to the 
extent that it gives the theory a chance to be wrong; it might turn out 
to be impossible to describe the data with the given function by any 
choice of parameter values. (Try fitting the acquisition curves of Fig. 5 
with y = 0”@*' or y = a + bn.) Therefore, if a reasonable description 
of the data is achieved, one must concede that the result provides some 
mild support for the theory (‘‘mild’ because the result will be of little 
help in choosing between this theory and alternative theories that may 
come under consideration). If only part of the data is used in estimat- 
ing the parameter, then a genuine prediction is possible. If, for exam- 
ple, in an experimental application of Eq. (6), the value of © were 
determined by using only data from the first few trials of a learning 
Series, then the curve for the remainder of the series could be con- 
sidered as testing a theoretical prediction. This is a rather low-level 
and unglamorous form of prediction, but for all that it should not 
be taken too lightly; we shall see some examples later on in which 
Predictions of this sort are so markedly superior to what could be ac- 
complished without a theory that they represent no small achievement. 

When a quantitative theory is in the process of development, the 
first tests of any newly derived functions are usually of the simple 
curve-fitting type. Once this first hurdle has been passed, however, we 
are always eager to move on to higher-level forms of prediction which 
Involve an element of overdetermination. We speak of a parameter as 
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“overdetermined” if its value can be empirically determined in two or 
more independent ways. We have already found (cf. Formulation ofa 
probabilistic response model) that the parameter h of our response 
model and the parameter O of our acquisition model are overdeter- 
mined. Indeed the structure of our theory is such that not only these 
but any additional parameters which may be introduced in the course 
of further developments and extensions will similarly be overdeter- 
mined in the sense of entering into numerous predicted functional 
relationships among operationally defined variables. 

The determination of a parameter by two or more independent 
procedures has always been regarded as one of the critical tests of 
physical theories, and there seems to be no reason why the same will 
not be true of behavioral theories. Tests of this sort have been among 
the primary objectives of several research studies involving applica- 
tions of the statistical model for acquisition. In some cases [11, 27], the 
results have been clearly confirmatory; more often [1, 8, 37, 89], the 
results have included a mixture of successes and failures, thus leading 
either to reconsideration of the theoretical assumptions or to further 
analysis of the experimental situation, or both. For illustrative put- 
poses we might consider briefly an instance of the latter sort which has 
arisen in connection with certain human learning experiments. 

When acquisition functions for the simple two-alternative verbal 
conditioning experiment are derived from the present theory—see [37] 
for details—the parameter © turns up both in the equation for the 
mean acquisition curve, Eq. (9) below, and in the formulas for the 
conditional probabilities of a given response on trials following the two 
types of reinforcing events. In several experiments [1, 37] estimates of 
9 obtained from mean acquisition curves and estimates obtained from 
conditional relative frequencies have failed to agree. These disparities 
gave rise first of all to mild chagrin on the part of the theorist, then to 
more constructive reactions of two kinds. Noting that all of the experi- 
ments in question had been conducted with temporally massed trials, 
Estes and Straughan [37] hypothesized that some extraneous factor 
associated with massed trials might be responsible for sequential 
effects in the data. Such effects have, for example, been found in 
psychophysical studies [15, 90]. If so, then more adequate trial spacing 
might tend to minimize the extraneous factors and thus eliminate the 
disparity. The hypothesis has not been decisively tested, but the eyg 
dence so far available [27, 89] favors the conclusion that the disparity 
is reduced by more adequate trial spacing. 

The question also arises whether the theory might not be extended 
to take account of the sequential effects observed with massed trials- 
An attempt to handle one possible source of sequential effects has been 
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reported by Straughan [88]. Straughan considered the possibility 
that there might be a certain amount of lag, or inertia, in the stimulus 
sampling process, so that a stimulus element which has been sampled 
on a given trial would be more likely to be sampled on the following 
trial than an element which has not. Introducing this “lag”? assump- 
tion into the acquisition model, Straughan showed that predictions 
concerning forms and asymptotes of acquisition curves are unaffected 
but that sequential effects of the kind observed in massed-trial experi- 
ments are predictable from the revised model. 

By exploiting the notion of overdetermination, it is possible to 
conduct exceedingly stringent tests of a model while working entirely 
within a single experimental situation. Yet even if these tests are suc- 
cessfully passed, our theoretical ambitions are not satisfied. Next we 
begin to hope that predictions may be possible on a still higher level of 
generality, that involving situational invariance. That is, we would like 
to evaluate the parameters of the model in one situation and then use 
the obtained values to generate completely a priori predictions of 
behavior in new situations. This is always a difficult step; in order to 
achieve it, one needs not only an adequate model but also a sufficiently 
detailed understanding of the variables involved in a number of dif- 
ferent experimental situations. Failing the latter, the best model will 
have little chance of succeeding at cross-situational prediction. 

The first attempt to achieve the third level of prediction with the 
present model was carried out on a very modest scale. In preliminary 
investigations it had been found that functions derived from the model 
provided satisfactory descriptions of bar-pressing acquisition curves, 
both for individual animals and for groups. The acquisition function 
contained the two free parameters which have been previously men- 
tioned, the sampling parameter © and the minimum response time /, 
and it was a clear implication of the theory that other things equal, the 
values of these parameters should be independent of the particular 
response classification. Therefore, once these parameters were evalu- 
ated for a group of animals during acquisition of one bar-pressing 
habit, it should be possible to predict in advance the detailed course of 
Acquisition of a second bar-pressing habit. Using an apparatus which 
contained two bars in different locations, the indicated experiment 
Was carried out with clearly confirmatory results [20]. This instance 
of successful prediction, even on a very limited scale, was the first 
Concrete evidence that we might be on the path toward a theory of 
Some generality. 3 4 

A second type of experiment which had been sufficiently studied to 
Provide a promising test situation was T-maze learning in the rat. If 
the model gives an adequate representation of T-maze learning, then 
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not only should an appropriate function derived from the model, 
namely, Eq. (6), describe acquisition of a particular maze response, 
e.g., turning left, but once the parameter © has been evaluated, the 
model should provide a priori predictions of the course of learning 
during a series of reversals. However, judging from the extant litera- 
ture on simple reversal learning [e.g., 50, 71, 77, 85] prospects of suc- 
cessful prediction were anything but bright. All previously reported 
studies agree that rate of learning improves during successive reversals; 
if so, then the value of O, which determines the slope of the learning 
curve, could not remain constant over a series of reversals, and the 
theoretical predictions would be bound to fail. 

Critical study of the literature revealed one possible loophole; all 
the preceding studies had been conducted with temporally massed 
trials, and we already had some evidence from verbal conditioning 
experiments that some of the simplifying assumptions of the model 
were not well satisfied under these conditions. Consequently, D. W. 
Lauer and the writer planned and carried out a study of successive 
reversals in the T-maze situation with a design which permitted a 
comparison of learning rates under widely spaced trials with learning 
rates under closely massed trials [36]. Since our spaced condition in- 
volved single daily trials, it took over four months to run through five . 
series, The results, however, were enlightening. The usual progressive 
improvement in learning rate was observed in our massed-trial groups, 
but not in our spaced-trial group. For the latter, the value of the slope 
parameter remained so clearly constant over successive reversals 
that values could be estimated from the data of earlier series and used 
to predict the learning curves of later series, 

Attempts at series-to-series prediction in simple two-choice experi- 
ments with human subjects have run into similar problems. When 
trials are temporally massed, rate of learning characteristically in- 
creases over successive series [37], but as spacing is increased, this 
effect is reduced and the slope parameter © becomes more nearly 
recoverable from one series to the next [27, 33, 89]. 

Effects of reinforcement. The achievement of our acquisition theory 
that has aroused the widest interest, extending to statisticians, game 
theorists, and economists, as well as psychologists, is the successful 
prediction of asymptotic response probabilities under a variety of 
random reinforcement schedules and in both animal and human cases: 
These predictions are generated by the acquisition model sketched 
above, with occasional guidance from contiguity principles. The type 
of reasoning involved can be illustrated nicely in terms of the well- 
known T-maze experiment of Brunswik [9], with which is associated 
the origin of the now popular term probability learning. In Brunswik’s 
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experiment, rats were run under random reinforcement with a correc- 
tion procedure. Under 75 per cent reinforcement, for example, food 
was available on one side of the T with probability .75 and on the 
other side with probability .25. Since in this procedure the animal is 
permitted to correct errors, each trial terminates with a reinforced run 
to one side of the maze; and according to contiguity-interference 
principles, the response which occurs last on a trial should become 
connected to all cues sampled on the trial. Therefore, our acquisition 
function, Eq. (5), should describe the change in probability of the 
terminal response on each trial. Knowing the position of the food on 
each trial, we are in a position to make repeated application of Eq. (5) 
and thus predict the course of learning over a series of trials. This trial 
by trial computation would be laborious, however, and we prefer to 
take a shortcut. Given that on any trial one of the two alternative 
responses is reinforced with probability m and the ather with probability 
1 — q, it can be proved by mathematical induction that p„, the expected 
probability of the first response after n trials, is given by the simple 
formula 


h= r- (mi — OY (9) 


Since the value of © is between 0 and 1, the term (1 — @)"~! must ap- 
proach zero for large n, and we see at a glance that asymptotically the 
expected probability of a given response should be equal to the proba- 
bility of reinforcement. 

The results of Brunswik’s experiment were not obviously incom- 
patible with our prediction, but his series of trials was so abbreviated 
that one could not be confident his animals had reached asymptotes. In 
order to provide a more decisive test, D. W. Lauer and the writer have 
more recently conducted an experiment which essentially replicated 
Brunswik’s m = .75 condition except that our animals were run one 
trial per day over a much longer series. ‘This study [65] yielded mean 
asymptotic response probabilities (estimated from relative frequency 
of a given response over the last 16 days of a 56-day series) in close 
agreement with the theoretical predictions from Eq. (9). 

In a very similar manner the model can be applied to the type of 
verbal conditioning experiment originated by Humphreys [58]. In this 
Situation the experimenter assigns fixed probabilities, m and 1 — 7, toa 
Pair of alternative events; these events might, for example, be occur- 
rences of one or the other of a pair of “reinforcing” lights, saying of 
“right” or “wrong” by the experimenter, appearance of heads or tails 
on the toss of a biased coin. On each trial the subject predicts which 
event will occur and then observes the actual outcome. On the assump- 
tion that occurrence of a given event reinforces the response of predict- 
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ing that event, one again arrives at Eq. (9) as the predicted learning 
function—for derivation see [37]. 

The interesting prediction, implied by Eq. (9), that mean probabil- 
ity of predicting an event should be asymptotically equal to the actual 
probability of the event did not have to wait long for experimental 
test. During the same year in which Eq. (9) was first derived, two 
independent studies by Grant, Hake, and Hornseth [43] and by Jarvik 
[59] appeared, both confirming the predicted asymptotic relationship. 
To check further on the generality of the result, J. H. Straughan and 
the writer proceeded during the following year to study both original 
acquisition of a guessing response and reacquisition following a shift 
in probability of reinforcement; we obtained the predicted asymptotic 
relationship in both cases, and also demonstrated that Eq. (9) 
describes not only group curves but learning curves of individual 
subjects [37]. Equation (9) has been generalized and applied to guess- 
ing experiments involving more than two alternatives by Neimark [76] 
and Detambel [18] with uniformly confirmatory results.* 

The last two studies represent a step forward, for the most con- 
vincing tests of a theory are always those which involve new experi- 
mental arrangements. Going from two to three alternatives in the 
guessing situation may seem too minor a generalization to qualify 
as an instance of “predicting new facts,” but further study of the model 
suggests the possibility of dealing with some rather more radical 
innovations. Initially we thought only in terms of constant probabilities 
of reinforcement simply because experiments on partial reinforcement 
had traditionally been limited to this case. There is nothing in our 
theoretical assumptions, however, to restrict us to situations where 
probability of reinforcement is constant over trials. The assumptions 
specify how response probability changes on any given trial; therefore 
if m in Eq. (9) were replaced by some function Tn giving probability 
of reinforcement as a function of trials, it should be a purely mathe- 
matical problem to deduce the predicted course of learning. The 
mathematical problem has indeed been solved for a wide variety of 
cases [30]. The gist of the theoretical result is that response probability 
“tracks” probability of reinforcement much as one tracks a moving 
target; if mn is any reasonably orderly function, then asymptotically 
the curve of pn is described by a function of similar form. It can be 
predicted, for example, that if probability of reinforcement varies 
linearly with trials (i.e., m, is a straight line) then the curve of response 


* Results of a study, “Probability learning with two and three choices,” repor ted 
by R. A. Gardner at the 1957 meetings of the Midwestern Psychological Association 
suggest that under some, as yet incompletely specifiable, conditions, asymptotic 
response probabilities for the three-choice case tend to overshoot the predicted values. 
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probability will approach a straight line with the same slope. M. Johns 
and the writer have recently tested this prediction in a modified verbal 
conditioning situation, with results typified by the empirical curve 
shown in Fig. 6. 

Another natural application of the random reinforcement model 
arises if we permit the probability of reinforcement to depend on the 
response made by the subject. In conditioning terminology, this 
amounts to going over from the classical to the instrumental paradigm. 
Suppose, for example, that in a verbal conditioning situation the sub- 
ject’s task is to predict which of a pair of alternative events Eı and E, 
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will occur on each trial. In the type of experiment previously discussed, 
which we will call the noncontingent case, the actual probability of 
Ex is m and the probability of Er isi —7 regardless of the subject’s 
response. In the type of experiment now to be considered, the con- 
tingent case, the events Zı and E, have one set of probabilities, say 
Tu and 1 — 74; if the subject’s response on the trial is a prediction of 
Ei, but a different set of probabilities, T21 and 1 — ma, if the subject 
predicts E,. When the model is applied to this case [23, 30], the pre- 
dicted learning curve has the same form as that of Eq. (9) but has the 
asymptote 

a T21 (10) 

eo 1— mu + Ma 
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where pe represents the probability, after a large number of trials, 
the response corresponding to reinforcing event E;. As in the noncon- 
tingent case, the model for the contingent case can be generalized tc 
any number of alternatives [30]. In all instances, noncontingent and 
contingent, two or more than two alternatives, it turns out that 
pected asymptotic response probability depends only on the probabili- 
ties of reinforcement, and thus can be predicted in advance of an ex 
periment. Predictions of this sort for contingent cases with two and 
three alternatives have been confirmed experimentally by A. Bro 
and the writer in a camouflaged® guessing experiment. Experimei 
by Detambel [16, 18] and Neimark [76] could also be taken as yield 
ing confirmatory evidence, but they involve a procedural complication: 
which will be discussed below. i 
Still another variation on classical random reinforcement pro 
dures was suggested by the analogy between some of the guessing 
experiments successfully treated by our theory and simple ga 
situations. To illustrate the analogy, suppose two individuals, whom 
we will label as E and S, play at matching pennies. If E resolves alwa 
to turn up heads with probability m and tails with probability 1 — 
then he can apply Eq. (9) to predict changes in S’s probability of gues 
ing “heads.” But following this strategy, E will not make any mon 
if he chooses = .5 he can expect to break even; otherwise S will | 
able to match him more than half the time. In order to improve hi 
chances, E may complicate his strategy, perhaps by making hi 
selection of heads or tails on any trial depend on S’s guess on one or 
more preceding trials; for example, E might decide to choose heat 
with probability 7; = .25 if S has guessed heads on the preceding tri 


derived [30] which applies to all similar types of reinforcement co 
tingency. If in a two-alternative learning situation the probabili 
of reinforcing event E, on trial n is equal to m1; or m2; according a 
response A; or Az, respectively, has occurred on trial n — v, then 


° The situation was presented to the subject as one involving psychophysical judg- 
ments; the reinforcing events were signals purportedly indicating correct and inco 
rect choices but actually occurring randomly with probabilities m;; and 1 — Ti 
report of the results is in preparation. 
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Tx = 1.0, in a guessing situation, progressively slower rates of learn- 
ing were obtained as we went from v = 0 tov = 1 and then tov = 2, 
but the terminal relative frequency of Aı responses was close to the 
predicted value 

1 


bin > Tagg 


in all three cases (.75, .80, and .74, respectively, for three independent 
groups of subjects). 

Applications of the theory to predict simultaneously the behavior 
of both players in simple two-person games have been worked out and 
tested by the writer [31] and by Atkinson and Suppes [4]. Judging 
from the results of these exploratory studies, it seems quite a promising 
possibility that continued cultivation of the model for acquisition 
under determinate reinforcement will yield accounts of learning in a 
variety of situations which arise in the study of games and economic 
behavior. 

Effects of nonreinforcement. The studies of Detambel and Neimark 
raise an issue which we have delicately skirted so far—namely, what 
to do about nonreinforcement. The T-maze experiment with a cor- 
rection procedure and all the human learning experiments we have 
treated up to this point have an important characteristic in common: 
we can specify without ambiguity precisely what response is reinforced 
on each trial. There are many common learning situations which lack 
this characteristic, e.g., operant conditioning in the Skinner box, maze 
learning with a noncorrection procedure, human learning experi- 
ments which involve nonreward or incomplete information on some 
trials. For convenience in reference, let us refer to these two types of 
experimental arrangement as delerminate reinforcement and indeterminate 
reinforcement, respectively. { x 

From the standpoint of contiguity theory, this classification should 
be expected to be of critical importance. In determinate situations, 
each trial terminates with the occurrence of one or another of a set of 
mutually exclusive reinforcing events (E1, Eu, - - - » E,), and each of 
these events, whenever it occurs, evokes a response belonging to the 
Corresponding experimentally defined response class in the set (Ai, As, 
-< , Ar). In a guessing experiment, for example, the occurrence of 
event Æ; is assumed to evoke from the subject a reaction compatible 
with (belonging to the same class as) A;, the response of predicting Ej. 
Similarly, if the left goal box of a T maze is open and contains food 
whereas the door of the right goal box is locked, we can be confident 
that the last locomotor response on the trial will be going left and 
approaching the food. Whether or not one is intuitively comfortable 
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with our assumptions about the conditions of reinforcement in deter- 
minate situations, the fact remains that they lead to correct predictions. 

By contrast, consider, say, the T maze with a noncorrective pro- 
cedure. Now, on “incorrect” runs, the animal is confined for a given 
interval in the empty goal box. What effect will an unreinforced run 
have on probability of going to the given side? A natural first thought 
is that nonreinforcement of a right turn should reduce its probability 
(Apr = —®pr) and therefore increase the probability of a left turn 
Ap, = O(1 — fr) just as in the correction procedure. If so, then Eq. 
(9) should describe T-maze learning under a noncorrection procedure 
in which food is placed on one side with probability m and on the other 
with probability 1 — r. The prediction, however, turns out to be con- 
trary to fact. Several studies? have shown that with a noncorrective 
procedure almost all animals learn to go 100 per cent of the time to 
the more frequently reinforced side. These results have instigated an 
active line of theoretical and experimental investigation devoted to 
ascertaining the precise effects of an unreinforced maze trial. A rather 
oversimplified interpretation of contiguity theory might lead one to 
expect that an unreinforced T-maze trial should yield reinforcement 
of the incorrect turn, since that was the last response made at the 
choice point on the given trial; this assumption would, however, lead 
to even worse predictions than the preceding one. What is obviously 
left out of account is the fact that after entering an empty goal box 
the animal may attempt to retrace while still under the influence of 
some of the stimuli that were effective during the maze run. If we 
assume that such attempts at retracing do occur, and more specifically 
that the probability of the retracing response is equal to the momentary 
probability of the “correct” response, then one can derive the interest- 
ing prediction [19] that, on the average, response probabilities will 
remain unchanged on such trials. Introducing the assumption into our 
model for partial reinforcement we find that the observed results are 
now predicted quite accurately. 

Still our treatment of the noncorrection case is not entirely satis- 


" These include studies, all unpublished, by J. C. Stanley [cited in 13, chap. 13), 
D. R. Meyer (reported at the Midwestern Psychological Association meetings IN 
1954), R. Hickson and T. Carterette (reported at the Midwestern Psychological Ass0- 
ciation meetingsin 1955), and S. Weinstock and A. J. North (personal communication). 
* Now the equation 


past = (1 — O)pn +O 


is to be applied on trials when a response is reinforced and the equation 


Pari = fn 


on trials when it is not. 
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fying. To handle acquisition under a partial reinforcement schedule 
we must assume that an unreinforced trial yields no mean change in 
response probability. But this assumption has the embarrassing conse- 
quence of leading to the prediction that extinction will never occur 
even if reinforcement is permanently discontinued. I cannot claim, at 
the time of writing, to have escaped entirely from this predicament, 
but a series of as yet unpublished experiments has suggested a way out. 
Studying the effects of blocks of forced reinforced and forced unrein- 
forced maze runs, I have found that in the naive animal an unrein- 
forced trial produces no apparent change in response probabilities, but 
that after a series of reinforced runs to a given side of a T mazie, the 
introduction of nonreinforcement yields a decrement in probablity of 
response to that side describable by the difference equation p,41 = 
(1 — ©)p,. Apparently under partial reinforcement schedules, the 
effect of nonreinforcement varies between these extremes; the quanti- 
tative description of this variation remains a problem for further 
research. 

In human learning experiments, the introduction of unreinforced 
trials raises problems analogous in many respects to those brought out 
by the T-maze example. Nonreinforcement arises in the Thorndikian 
situation if reward or knowledge of results is withheld, in the guessing 
experiment if on some trials none of the events E; occurs. As in the 
animal case, contiguity principles are of no help in predicting the 
effect of an unreinforced trial unless we can specify just what nonrein- 
forcement leads the subject to do. Certainly this will depend on the 
nature of the task and the instructions, so no one specification is likely 
to hold in general. By considering particular experimental procedures, 
we may, however, be able to define classes of procedures which yield 
similar outcomes. One class for which we might expect to find con- 
siderable homogeneity comprises situations in which either the nature 
of the instructions or the nature of the task leads the subject to interpret 
a blank trial (i.e., one on which none of the experimentally defined 
reinforcing events occurs) as indicating neither correctness nor Incor- 
Tectness of his response. In such situations, contiguity assumptions 
lead to the prediction that the effect of a blank trial on response 
Probability will be described, on the average, by 


Pr+ı = pn 


Incorporating this hypothesis into our acquisition model, we can 
derive learning curves for a modified guessing experiment in which the 
Subject’s task is to guess which of a set of reinforcing events (En E», 
- .. , E,) will occur but in which the probabilities of these events do 
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not add up to unity. The predicted curve has the same form as that of 
Eq. (9), but the new asymptote is 
Tj 
Tak Masten ts Tr 


where fj, is the mean asymptotic probability of predicting event Ej; 
and 1 — (mı + m2 + > + - +7;) is, of course, the probability of a 
blank trial. Derivations of the learning function for this case have 
been reported by Neimark [76], Estes [24], and Bush and Mosteller 
[13, chap. 13], and experimental results which appear generally con- 
firmatory have been reported by Neimark [76], and by Anderson [1]. 
Very recently, a more stringent test of the “neutral” hypothesis has 
been provided by Atkinson [2], who showed that under a specially 
contrived reinforcement schedule differential outcomes are predictable 
from three different hypotheses about the effect of a blank trial. The 
hypotheses are 


Pi. 


(A) The “neutral” hypothesis: on the average a blank trial leaves all response 
probabilities unchanged. 

(B) The response which occurs on a blank trial is conditioned to all cues 
sampled on the trial. 

(C) All the alternative responses are equally likely to be conditioned to the 
cues sampled on the blank trial. 


Atkinson’s experimental results appeared incompatible with B or ©, 
but compatible with A. 

In a second important class of experiments, the instructions to the 
subject state or imply that omission of reinforcement indicates a wrong 
response. An experiment of this type reported by Detambel [16] 
utilized two response keys and a single “reinforcing” light; the sub- 
ject’s task was to attempt on each trial “to choose the key that will 
turn on the light.” In one of Detambel’s groups, the light appeared 
following responses to key Aı with probability .5 and following re- 
sponses to key A, with probability zero. Assuming that under these 
conditions nonreinforcement of A, will have the same effect on the 
response probabilities as reinforcement of Aı, and vice versa, we can 
enter the values mıı = .5 and mz = 1 in Eq. (10) and obtain for the 
predicted asymptotic probability of response Aj, 


= 1 = 
1-541 
which proves to be in good agreement with Detambel’s data.? 


° Data for a variety of reinforcement schedules in this experimental situation have 
been reported by Brand, Woods, and Sakoda [7]. 
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When this last type of experiment involves more than two alterna- 
tive response classes, a new theoretical problem arises which has 
instigated considerable investigation but has not been adequately 
solved. Imagine an experiment identical to that of Detambel described 
above, except that three response keys are available to the subject. 
Now, as before, a response will be expected to receive an increment in 
probability pj,n41 = (1 — ®)p;,n + © when it is followed by the rein- 
forcing light and a decrement /;,n41 = (1 — 8)p;,n when it is not. In 
either case the change in /;,, must be compensated by a change in one 
or both of the other response probabilities. If, e.g., Aı occurs and the 
reinforcing light does not appear, we must have 


Ping = (1 — Odin 
and Pangi + pany = (1 — O) (pon + pian) + 8 


in order to keep the probabilities adding up to unity. But how is the 
increment to be distributed between classes A» and A;? The mathe- 
matical theory will be satisfied by any mode of distribution which 
conserves total probability. One might expect that the increment 
would be divided equally among the remaining response classes, but 
Neimark has shown that predictions based on this assumption deviate 
systematically from observed terminal mean response probabilities 
[76]. Several investigators have suggested that some sort of weighting 
scheme should be employed, so that the increment will be most likely 
to go to the response which has been most frequently reinforced. 
A purely ad hoc weighting system is not particularly satisfying, however, 
and no one has yet succeeded in deriving an adequate weighting 
system from more general theoretical assumptions. $ 
How can we summarize the present status of the theory relative to 
interpretations of nonreinforcement? For situations involving only two 
response classes, satisfactory interpretations can be provided for two 
extreme cases. If instructions or preliminary training ensure that non- 
reinforcement of a given response either produces no change in re- 
Sponse probability or else produces a decrement equivalent to that 
resulting from reinforcement of a competing response, then accurate 
Predictions of curve forms and asymptotes are generated by the theory. 
Experimental conditions can be specified which appear to satisfy 
the assumptions associated with these two cases. For experiments in 
which the effect of nonreinforcement is not well determined by instruc- 
tions or preliminary training, obtained learning curves appear always 
to fall between those corresponding to the “neutral” and “decre- 
mental” cases [see, e.g., 13, chap. 13; 42, 76]. The model for the 
“neutral” case has been successfully extended to include situations 
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involving more than two response classes, but a comparable extension 
has not been achieved for the ‘‘decremental”? case. 

The series of problems generated by our attempts to handle non- 
reinforcement has been instructive in a number of respects. In it we 
have found numerous reminders that the role of a theory, at any level 
of analysis, is to process information, not to substitute for empirical 
investigations. When, for a given situation, we are able to specify the 
response which is reinforced by each distinguishable outcome of an 
experimental trial, our mathematical theory generates accurate 
quantitative predictions as to the course and asymptote of learning, 
but when our specification is incomplete, the theory remains silent. 
Similarly, when we are able to specify the response that will be evoked 
by an experimental event, the contiguity principle is helpful in indicat- 
ing what reinforcing effect the event will have; but when we lack this 
specification, the contiguity principle cannot be applied. Always a 
theory first asks us what we know, then tells us what we can predict. 

We also have seen illustrations of how theoretically significant 
classifications evolve from the interaction of theory and experiment. 
Even the most familiar empirical landscape may be quite altered in 
appearance when viewed through the medium of a new theory. New 
distinctions are forced upon us; old lines of demarcation sometimes 
disappear, sometimes take on new significance. Time after time we 
find that with a modification of some assumption, or merely on recom- 
bination of assumptions, the theory delivers a new set of theorems 
(experimental laws). In applying the theory we are led, then, to classify 
together sets of situations which are described by the same sets of laws. 
Thus we find that experiments of the maze type run with correction 
procedures and human learning experiments involving complete 
knowledge of results (full information as to which response is correct on 
each trial) are described by the same quantitative laws; therefore we 
group these together under the label determinate reinforcement. Experi- 
ments which involve nonreinforcement or incomplete knowledge of 
results are not all describable by a single set of laws, however, and must 
be further analyzed. 

Independence of learning rates and asymptotes. The unsettled question of 
what to do about nonreinforcement has not been the only roadblock on 
our way to an adequate account of acquisition under partial reinforce- 
ment. Another cluster of vexing problems has been associated with 
two parameters, m and © in Eq. (9), which should behave independ- 
ently but which persistently refuse to do so. According to our assump- 
tions, the sampling parameter © depends only on stimulus variables 
(plus characteristics of the organism) and, therefore, should be unaf- 
fected by variation in the probability of reinforcement. In both animal 
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and human cases, however, it is found that the rate of learning, and 
therefore the value of O, varies directly with the difference between 
initial response probability and probability of reinforcement in any 
series [37, 65, 76, 89]. The hypothesis has been suggested [37] that this 
relation may be, in a sense, an artifact of massed trial procedures. 
In the derivation of Eq. (9) it was assumed that a single population of 
stimulus elements is sampled independently on each trial. If, however, 
trials occur in close temporal succession, then the stimulus complex 
affecting the subject on any one trial may include traces of the stimula- 
tion associated with the responses and reinforcing events of one or 
more preceding trials. Then O might well depend on 7, for the pattern 
of trace stimulation would be more variable for m near .5 than for m 
near 1 or 0. However, the relationship should break down if the in- 
tertrial interval is increased or if other measures are utilized to reduce 
the carry-over of trace stimulation from one trial to the next. Experi- 
mental tests of this hypothesis have yielded curiously mixed outcomes. 
In experiments with human subjects, increasing the effective trial 
spacing has usually resulted in decreasing the dependence of © on m 
and improving the fit of the model [27, 33, 89]. In the T-maze experi- 
ments, however, the same does not appear to be true; acquisition is 
faster (© is greater) with uniform reinforcement than with partial 
reinforcement even when maze trials are spaced at 24-hr inter- 
vals [36, 65]. 

It would be convenient if we could settle each problem before 
moving on to the next. However, nature is not always so obliging as to 
permit this, and we cannot allow our general theoretical advance to be 
held up indefinitely by a few isolated pillboxes. While some of our 
forces remain pinned down by refractory problems associated with 
nonreinforcement and parameter invariance, we continue to make 
Progress on other fronts. 

Stimulus variability. We have seen that the statistical model assumes 
learning to be basically a sampling process. On each trial the organism 
samples the population of stimulus elements defined by the experi- 
mental situation and only the elements sampled on the trial influence 
response probability or undergo changes of state as a consequence of 
reinforcement. These assumptions are reflected in such acquisition 
functions as Eqs. (6), (7), and (9) where O, defined in the model as the 
mean sampling probability of stimulus elements, appears as a slope 
parameter. The fact that various derived functions containing 9 as a 
parameter have turned out to provide good descriptions of empirical 
acquisition curves may be taken as supporting the theory in a general 
way, but it is not critical with respect to any particular aspect of the 
theory. In any one acquisition series, the stimulating situation is 
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fixed; therefore, no matter how well we succeed in fitting a theoretical ` 
function to an empirical curve, we obtain no information as to how the 
value of O depends on stimulus properties. To obtain such information, 
we must ascertain whether © behaves appropriately when stimulus 
conditions are systematically varied. 

There are no standard methods for controlling the variability of a 
stimulating situation," so it has been necessary to improvise methods 
for the desired test experiments. Burke, Estes, and Hellyer [11] intro- 
duced gradations in stimulus variability by assigning varying sampling 
probabilities to the components of a complex stimulus. Subjects in a 
two-alternative verbal conditioning situation were presented with a 
pattern of lights on a display panel as the ready signal at the beginning 
of each trial. In the first experiment, the probabilities of reinforcement 
were independent of the signal pattern, but the individual panel lights 
were assigned sampling probabilities of 1/3, 2/3, and 1 for three groups 
of subjects. Therefore, stimulus elements associated with any one light 
should have © values in the ratio 1/3:2/3:1 for the three groups. 
Results of the experiment confirmed the quantitative predictions that 
the rate of learning should be directly related to stimulus sampling 
probability while the asymptote of learning should be independent of 
it, and also agreed well with quantitative predictions derived from the 
theory. 

A follow-up experiment [33] used a discriminative situation; the 
procedure differed from that just described only in that the subject’s 
task was to discriminate between two sets of signal lights, and the 
sampling probabilities for individual lights were 1/2 and 1 for “vari- 
able” and “constant” stimulus groups. Again the principle trends in 
the data were in agreement with theoretical predictions. As usual, 
however, the correspondence between theory and data, although 
promising, was not perfect. The principal disparity in this case was 
that for the discrimination group, the predicted effects of stimulus 
variability on learning rate were smaller than expected, and in fact 
not statistically reliable, during original acquisition; only during 4 
reversal series were the effects significant and of the expected order of 
magnitude. The same disparity has appeared in further, unpublished, 
studies, and therefore probably cannot be written off to sampling 
error. It has been suggested [33] that the effects of experimentally 
controlled stimuli, e.g., the panel lights, may be attenuated on early 


1 Several investigators, e.g., Voeks [91], Wolfle [94], have reported effects of 
stimulus variability on learning rate which appear to be in qualitative agreement witk 
theoretical expectation; the techniques used for manipulating stimulus variability © 
those studies do not, however, lend themselves to quantitative tests of the present 
theory. 
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trials by the presence of uncontrolled stimulation, primarily verbal, 
associated with the giving of instructions and initial adjustment to the 
experimental situation on the part of the subject; this hypothesis should 
not be difficult to check on experimentally. 

Transfer and stimulus compounding. All successful applications of 
statistical learning theory provide indirect support for the assumption 
that probability of a response is equal to the proportion of stimulus 
elements in the trial sample that are connected to the response. So 
long as the stimulating situation remains fixed, however, no evidence 
is obtainable that could be considered critical with respect to this 
assumption. As we shall see in a later section, other investigators, 
Starting with quite different assumptions, sometimes arrive at learning 
functions identical in form to functions derived from our assumptions. 
To obtain more direct tests, we must determine response probability 
in the presence of two or more stimulus situations, then predict the 
probability in a new situation which comprises portions of the other 
Situations in a new combination. An experiment of this type by Fink 
and Patton [39] has yielded qualitatively confirmatory findings, but 
for a more rigorous test we must turn to Schoeffler’s carefully con- 
trolled quantitative study [83]. 

As the first stage in his study, Schoeffler trained human subjects to 
discriminate between two randomly selected, nonoverlapping sets of 
signal lights, Sı and Sy, on a display panel. After the subjects had at- 
tained a criterion of 100 per cent correct responding (i.e., one response 
A, reached a probability of approximately unity in the presence of Sı 
and a second response A», a probability of approximately unity in the 
Presence of \,) they were tested on various new combinations, ¢.g., 
all of $, plus all of Sz, half of S, plus one fourth of 5», all of 5; plus anew 
Set S; which had not been’ presented during discrimination training. 
Determinations of response probability on a sample of 180 subjects 
yielded close agreement between predicted and obtained values. In the 
Cases Just cited, for example, the predicted probabilities of the Ai 
response were .5, .67, and .75, respectively; the observed probabilities 
Were .54, .63, and .73. Comparable accuracy of prediction has been 
Obtained in tests of compounding following the “probabilistic dis- 
crimination” training described in a later section, and, by Peterson 
[78], in tests of compounding following verbal association training. 

Generalizations of the original statistical model. Behavioral 
theories have always to deal with extremely complex empirical situa- 
tions. Consequently, in order to get started on the formulation of a 
new theory, it is always necessary to resort to ruthless simplification 
and idealization of the empirical system. All but a very few variables 
are ignored completely, and those that remain are permitted only 
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the simplest properties. Once the enterprise has gained some momen- 
tum and the initial maximally simplified model has survived some 
preliminary empirical tests, it is time to begin asking how the theory 
would be modified if some of the simplifying assumptions were relaxed. 
At a minimum, the ensuing analyses yield improved perspective as 
to the direction and magnitude of errors of approximation involved in 
working with the original model. In some cases the results are much 
more far reaching and lead to important generalizations of the theory. 

Variation in stimulus sampling probabilities. During the initial formula- 
tion of the stimulus sampling model it was expedient to assume that all 
stimulus elements in a given population have equal probabilities of 
being sampled on any learning trial. Utilizing this simplifying assump- 
tion, we arrived at a set of simple laws, e.g., those represented by Eqs. 
(4) and (5) above, which have been the basis for a variety of empirical 
applications. The first occasion for seriously questioning this assump- 
tion arose in the fall of 1951 when C. J. Burke and I were invited to 
contribute a paper on set-theoretical models to a symposium on learn- 
ing theory at the meetings of the Institute of Mathematical Statistics. 
In order to give a systematic account of the class of models represented 
by the original statistical model, we undertook to investigate the conse- 
quences of removing the assumption of equal sampling probabilities. 
The result of this investigation [cf. 32] was a more general model which 
included the original one as a special case (the “equal © case”). 

The general model (“unequal © case”) is not very convenient to 
work with for practical purposes, but study of its properties has led to 
some interesting conclusions. One of these is the finding that under all 
conditions of reinforcement so far investigated, predicted asymptotes of 
learning are independent of variation in stimulus-sampling probabili- 
ties. If we compare the acquisition curve ‘derived for any particular 
situation from the general model with the corresponding equal © case, 
we find that in each instance the curves not only go to the same asymp- 
totes but have similar forms. Further, even when there is considerable 
dispersion of © values, the curves derived on the equal © assumption 
provide adequate approximations to the exact curves for most pur- 
poses of experimental application. A finding of somewhat different 
character is that when there is dispersion of © values in a stimulus 
population, the predicted slope of a curve of relearning following 
extinction will be steeper than that of the original learning curve. 

Stimulus fluctuation model for time-dependent processes. From what we 
know of the structure and function of organisms it seems clear that the 
process of stimulation must involve at least two sources of variability, 
one with a locus primarily in the environment and one with a locus 
primarily in the organism. Effective stimulation occurs when energy 
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changes in the environment give rise to activity in a receptor system. 
Any variation in the energy input from the environment must be 
expected to produce variation in effective stimulation. For some pur- 
poses it is convenient to assume that stimulating conditions can be held 
perfectly constant throughout an experimental period and can be 
exactly reproduced from one period to the next. Even if this assump- 
tion were satisfied, variability in stimulation would still be produced 
both by variations in the gross orientation of the subject relative to 
sources of stimulation and by variation in physiological properties of 
receptor systems. The subject-determined variability was taken into 
account in the original statistical model by the assumption that the 
subject draws random trial samples of stimulus elements from a fixed 
population. It was immediately apparent, however, that the theo- 
retical restriction to a fixed stimulus population was paralleled by a 
restriction on empirical applicability of the model to situations in 
which such time-dependent processes as forgetting and spontaneous 
recovery could be assumed negligible. 

These considerations led me to investigate the consequences of per- 
mitting a stimulus population to vary in composition during the 
Course of learning; this variation would, of course, be the conceptual 
counterpart of variation in environmental conditions. The results of 
the ensuing mathematical analyses have been incorporated into a 
more general model which includes population constancy as a special 
case and which also permits treatment of a variety of distributional 
Phenomena which were beyond the scope of the original formulation. 

The extended theory assumes that the stimulus events which cor- 
respond to stimulus elements in the set model depend upon numerous 
independently variable aspects or components of the environmental 
situation. The dependency is such that a given stimulus element is 
available for sampling during a given experimental period only if a 
Particular set of conditions is present in the environmental situation, 
Presumably systematic variation in environmental conditions can be 
ruled out by appropriate experimental controls, but there must always 
be some residual random variability in physical properties of the 
Situation, Intuitively, one would expect random fluctuation in environ- 
mental conditions to be readily observable in most learning experi- 
Ments but perhaps detectable only by instrumental measurement in 
the best controlled situations. In any event, no sweeping commitment 
as to magnitude of these effects is necessary. We simply assume that 
the sources of stimulation in any experimental situation undergo con- 
Stant random fluctuation and proceed to derive the testable conse- 
quences of this assumption. t 

The mathematical properties of the fluctuation process—cf. [25] 
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for details—are such that each stimulus element has some fixed 
probability of being available for sampling during any given period. 
The important consequence of stimulus fluctuation for learning theory 
is that stimulus elements which are available for sampling during 
a learning period may be replaced by other, previously unavailable, 
elements prior to a subsequent test period; thus response probabilities 


End first End rest 
acquisition interval 


End second End rest 

acquisition interval 
Fic. 7. Representation of retention losses during acqui- 
sition in terms of stimulus element model. During the 
rest interval following each acquisition period, condi- 
tioned and unconditioned elements (closed and open 
dots, respectively) are randomly interchanged between 
the available and unavailable portions of the stimulus 
population (S$ and S’, respectively) until mean densi- 
ties in the two portions are equalized. Proportion of 
closed dots in the upper half of each circle represents 
momentary response probability. 


may change over a period of time during which there has been no 
opportunity for learning. 

Consider, for example, the hypothetical conditioning situation 
schematized in Fig. 7. The total set of stimulus elements potentially 
available in the situation is partitioned into the subset $ of elements 
available for sampling by the subject during any one period and the 
subset S” of elements unavailable for sampling during the period. Open 
and closed circles represent elements that are or are not, respectively, 
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connected to the conditioned response. Initially, let us assume, all 
elements are unconditioned. At the end of the acquisition period, all 
elements in S have become conditioned and probability of the con- 
ditioned response has reached unity. During a rest period, however, 
the random fluctuation process leads to an interchange of elements 
between S and S’. As this,interchange continues, the proportion of 
conditioned elements in $ decreases until the proportions in $ and $” 
reach equality. Consequently, if subjects are returned to the experi- 
mental situation and tested after various rest intervals, we will find that 
probability of the conditioned response decreases from 1.0 to .5 along 
a negatively accelerated curve as a function of the rest interval. If a 
second period of conditioning is given after a rest interval, the total 
number of conditioned elements will be increased over the previous 
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Fic. 8. Proportion of correct responses per trial for first four 
trials of four successive days in a T-maze study with contin- 
uous reinforcement. 


Proportion 


total (from 4 to 6 in the example) and therefore response probability 
will exhibit a smaller decline (from 1.0 to .75 in the example) during 
a rest period following the second acquisition period. Over a series of 
Acquisition periods, the amount of retention loss (“forgetting”) be- 
tween periods will continue to decline until eventually all elements are 
Conditioned and the interchange of elements between periods leads 
to no further change in response probability. In other words, amount 
of retention increases with the number of acquisition periods. In Fig. 8 
are shown empirical curves for successive periods of acquisition of a 
maze response [for experimental details cf. 36]. The intraperiod and 
Interperiod changes in response proportions demonstrate nicely the 
€cts anticipated from the schema of Fig. 7. 3 

From the assumption of constant fluctuation, taken together with 

Original statistical model, it has been possible to derive a set of 
quantitative laws describing curves of spontaneous recovery and for- 
Setting, and to account for numerous phenomena relating to distribu- 
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tion of practice [25, 26]. Also, the extended theory provides for‘ 
distinction between the two aspects of learning represented by 
potential and habit strength in Hull’s system [55] or reflex 
and reflex reserve in Skinner’s [85]. In the statistical theory, 
tary response strength is defined in terms of the set of clements th 
available for sampling during a given period, habit strength in term 
of the total population of elements that may become available at an 
time during an experiment. Resistance to extinction may continue 
increase even after momentary response probability has reached a 
asymptote if the additional learning trials are spaced out sufficiently s 
that there is opportunity for interchange of previously available and 
previously unavailable elements. J 

Quantitative predictions generated by the fluctuation model hay 
been tested in several experimental studies. A number of predicted 
characteristics of spontancous recovery and regression (retention lo 
in simple instrumental learning situations have been demonstrated bi 
Lauer and Estes [36, 65]. The most elaborate quantitative applicatio 
of the model is Homme’s study of spontaneous recovery in a bar-pres 
ing situation [54], Homme obtained evidence confirming predict 
relations between the number and distribution of reinforcements © 
response and properties of curves for subsequent extinction and m 
covery. He also showed that in order to provide a detailed quantitati 
description of the bar-pressing data it is necessary to modify m 
simplifying assumption of equal fluctuation probabilities for all stim 
in a situation; Homme’s modification consists in assuming Wa 
some of the stimulus elements in a population are constantly availab 
while the remainder undergo random fluctuation, Another elabor 
tion of the original model has been proposed by McConnell [69] w 
derived regression and recovery curves applicable to situation 


concerning phenomena of retroactive and proactive interference, 1 
type of application is illustrated for a hypothetical retroaction expen 
ment in Fig. 9. The experiment is assumed to be one in which & 
stimulus situation remains the same and only reinforcement cont 
gencies change from original to interpolated learning. Solid circles # 
the diagram represent stimulus elements connected to the respa 
reinforced during original learning; open circles, elements © 
nected to the response reinforced during interpolated learning. IH 
sets S and S are defined as in Fig. 7. From the same assumptions ¢ 

cussed in relation to the example of Fig. 7, it clearly follows @ 
probability of the originally reinforced response on the test period 


The Statistical Approach to Learning Theory 429 


be inversely related to amount of interpolated learning and directly 
related to duration of the rest interval between interpolated learning 
and the test period. A preliminary test of these predictions has been 
provided by Frankmann's recent study of retroactive inhibition in a 
T-maze situation [40]. The study was exploratory in nature, utilizing 
only two values of each independent variable, and the results, al- 
though generally in accord with expectation, are hardly conclusive. 
It does scem clear, however, that quantitative predictions concerning 


Und origina! Veorning 


Pe. 9, Representation of retroactive inhibition 
Closed and open dots, respectively, represent stimuls elements 
reponse reinforced during original learning 


Matistical learning theory has been, at the mathematical level, a story 
Of progressively increasing abstraction. ‘The initial formulations were 
Strongly molded by the characteristics of particular empirical relation- 
ships, and formal assumptions were inextricably intermingled with 
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substantive hypotheses. In order to clarify the structure of the theory” 
and pave the way for extension and generalization, it was necessary” 
to disentangle assumptions as to how learning proceeds from assump. 
tions concerning necessary and sufficient conditions of learning. This 
had been largely accomplished by the time Burke and I reported” 
aed generalization of the set-theoretical model for stimulus variables 
32]. 

Once the mathematical structure of a theory has been clarified, a 
natural next step is to study the class of mathematical models or sys- 
tems which it represents, Such analysis often brings out close inter- 
relationships among models that have initially taken diverse forms 
owing to peculiarities of empirical origins or predilections of particular 
investigators. In the case of statistical learning theory, it has been 
found that a number of superficially very different models can best be 

as instances of the class of stochastic (probabilistic) 

known as Markov processes. This finding has had a number of useful 
outcomes. It has facilitated formal comparison among different par- 
ticular models, e.g., those developed by Bush and Mosteller and those 
by Estes and Burke [29]. It has brought new mathematical 

methods to bear on problems which arise in working with the models. 
And researches which are in progress at the time of writing suggest 
that this line of investigation may yield a modification of present 
statistical models which will improve and extend the treatment of 
discrimination learning. J 

Extension to new empirical areas, Complementing the internal 
forces that lead to generalizations of a model are external ones arising 
from attempts to handle new empirical problems. Space will not pe 
mit us to discuss all of the researches in progress which are attempting 
to extend the empirical scope of statistical learning theory. It 
suffice, perhaps, for the purpose of sketching the present front line 
of theoretical advance, if we merely indicate briefly the nature of some 
of these lines of combined experimental and theoretical investigat 

Elementary discrimination learning. It should be possible to handle 
simplest forms of discrimination learning by a rather obvious ext 
of our acquisition model. The only complication is that we must 
fine a set of stimulus elements to correspond to each of the sti 
situations to be discriminated. Communality, or similarity, between 
situations will be represented by common elements. At one extreme 
a discrimination problem involving two totally different situations will 
be represented by two non-overlapping stimulus sets, say S, and Sy M 


The Statistical Approach to Learning Theory 431 


considered only trials on which Sı was presented, we would predict 
an acquisition curve of the form of Eq. (6), and if we considered on! 
trials on which $s was presented, we would predict a similar ron 
tion curve for the other response, Therefore, a curve for probability 
of correct responding against trials would have the same form as 
both of these and would go to unity as an asymptote. If, however, the 
two sets had some elements in common, then at any given time some of 
these common elements would be connected to Ay and some to As 
(50 per cent to each under the conditions we are considering if S, and 
Sı are presented equally often), and asymptotic probability of correct 
responding would be less than unity, 

Unfortunately, most conventional discrimination experiments are 
too complicated to provide suitable test situations for oy beet 
experimental situations of the Lashley type, for example, urat 
of exposure to the discriminative stimulus is uncontrolled, and may 
vary systematically during the course of learning. Data on discrimina- 
tion in classical conditioning situations [53, chap, 8) and simple oper- 
ant situations [44] appear qualitatively in accord with theoretical 
expectations, but there have been no parametric studies suitable for 
testing quantitative predictions, 

In the Indiana laboratory, we have carried out some quantitative 
tests using the same apparatus described above in connection with 
Mudies of stimulus variability, The light-guesting situation is modified 
by defining for cach subject two sets of panel lights, $; and Sy, which 
are to be discriminated, The two sets may be disjoint or they may have 
tome lights in common, and individual lights can be amigned any 
desired probabilities of appearing in a trial sample, On the positive 
tide, studies with this situation have found the predicted form for 
the curve of discrimination learning (33, 83) and have confirmed 
Predictions as to the effect of introducing partial reinforcement (33). 
Also, the model has suggested a new type of discrimination experiment 
in which the two stimulus situations to be discriminated ree as 
xcept for the sampling probabilities of the components, In . 
tion described shove pechaliliale discrimination” learning experi» 
ment will arise if the two sets of lights, S, and 5), are identical, but for 
trials on which response A, is reinforced the lights have one set of 
*anpling probabilities and for trials on which A, is reinforced they 
have a different set of sampling probabilities. For example, suppose 
each set included the same two lights, Lı and Ls; we could establish 
4 Probabilistic discrimination by assigning ela te m fete: g 
9 and A, respectively, of appearing on i 
and A and 9, Ea E E trials when A, is reinforced. The gist of 
the theoretical prediction for this type of situation is that the probabil- 
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ity of response A; in the presence of a given stimulus component, say 
L,, should change exponentially over a series of trials and should tend 
to an asymptote equal to the conditional probability that A; will be 
reinforced when J; is present. Confirmatory findings have been ob- 
tained in a recent experiment [34]. 

On the negative side, the prediction that in conventional discrimi- 
nation experiments the asymptotic probability of incorrect responding 
should be directly related to the proportion of common elements 
appears to fail in many situations. Even under relatively simple and 
well-controlled conditions, we find both human and animal subjects 
attaining virtually perfect discriminations between stimulus complexes 
or patterns which under any reasonable interpretation must be 
assumed to have many components or properties in common [see, e.g., 
17, 63, 78, 80]. What strategy to adopt in the face of these disparities 
is not easy to decide. The stimulus element model has proved so useful 
in accounting for acquisition phenomena and in dealing with stimulus- 
compounding and time-dependent processes, that we can hardly 
think of giving it up entirely. One course of action that has been 
explored by several recent investigators is to supplement the statistical 
acquisition model with some higher-order process. Burke, in an un- 
published study, and Wyckoff [95] have proposed models incorporat- 
ing the assumption that, in a discrimination situation, the organism 
learns “observing responses” which permit it to attend to the relevant 
cues in a situation (those on which differential reinforcement is based) 
and ignore the common cues; then discrimination learning relative to 
the relevant cues proceeds as in the simpler acquisition model. Restle 
[80, 81] has proposed a similar two-process theory which assumes that 
the organism simultaneously “adapts to” the irrelevant cues (common 
elements) in a situation and undergoes conditioning to the relevant 
cues. A somewhat different possibility, now under investigation by 
the writer, is to develop a more general sampling model which allows 
for learning with respect both to stimulus patterns and to stimulus 
components as special cases. 

Classical conditioning and habituatory decrement. It may seem odd that 
these situations, traditionally considered the simplest in the area of 
learning, have not been treated in detail by statistical learning theory. 
The reason for this apparent oversight is related to the fact that the 
simplicity of experiments on conditioning and habituation is paral- 
leled by a refinement in level of measurement. Duration of stimuli and 
time intervals between conditioned and reinforcing stimuli are ac- 
curately measured in conditioning experiments and are known to be 
important independent variables. Present statistical models can 
handle classical conditioning experiments at the same relatively gross 
level of analysis which characterizes treatments of maze learning or 
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learning of verbal associations, but as yet they do not handle relation- 
ships involving durations and magnitudes of stimuli or interstimulus 
intervals. Much effort has been expended on attempts to remedy 
this deficiency, but no adequate solution is yet at hand or even clearly 
foreseeable. The basic difficulty is that available data are inadequate to 
guide the theorist at a number of critical choice points. An adequate 
statistical model will have to include assumptions concerning the 
detailed course of growth and decay of stimulus samples. Unfortu- 
nately, there are several different sets of assumptions for which reason- 
able cases can be made, and available quantitative data on temporal 
relationships in conditioning are insufficient to decide among the 
alternatives. Thus theoretical progress in this area will have to go 
hand in hand with parametric experimentation. A start toward 
handling the variable of conditioned stimulus intensity in condition- 
ing has been reported by Barnes [5], and an extensive research program 
concerned with temporal and intensive variables in conditioning has 
been initiated by D. W. Lauer and his associates in the Indiana 
laboratory. 

Parameters of reinforcement. It seems reasonable to expect that such 
parameters as magnitude and delay of reinforcing stimuli should prove 
amenable to treatment by statistical models. Again, however, progress 
is slowed by a paucity of quantitative data suitable to guide extension 
of present theories. Investigations of amount of reinforcement in ani- 
mal learning have accomplished more toward testing explanatory 
hypotheses and turning up new empirical phenomena than toward 
determining quantitative relationships. 

According to a contiguity interpretation, magnitude of a reinforc- 
ing stimulus should be related to characteristics of learning only in so 
far as it influences the relative frequencies of responses compatible or 
incompatible with the reinforced response. Consider, for example, a 
runway situation with food as the reinforcing agent. Other things 
equal, increasing the quantity of food per reinforcement will normally 
have at least the following two consequences for behavior in the goal 
box. Firstly, a larger quantity of food can be seen or smelled at a 
greater distance and thus will be more likely to evoke an approach 
response while the animal is still exposed to cues from the portion of 
the runway path anterior to the goal box. Conditioning of approach 
responses to cues associated with progressively earlier portions of 
the runway will in turn lead to the reductions in running time and 
latency which are taken as indices of learning. Secondly, a larger 
quantity of food will require longer to consume and thus will delay the 
Occurrence of interfering responses, e.g., exploratory responses or 
attempts to escape from the goal box, which might otherwise become 
conditioned to runway cues and lead to increases in running time. 
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There have been several attempts to test this interpretation. A 
study by the writer [21] demonstrated that the usual differential 
effects of large as compared to small magnitudes of reinforcement upon 
a runway response are eliminated if similar reactions to the goal box 
are established by appropriate pretraining for the groups of animals 
which are to receive large vs. small amounts of reinforcing agent on 
runway trials. Following up this investigation, Hellyer [51] reasoned 
that if when distance cues are controlled, increasing the amount of 
reinforcing agent is effective primarily because it serves to delay the 
occurrence of competing responses in the goal box, then similar effects 
should be obtained by alternative procedures which operate in the 
same way. In an ingeniously designed runway experiment, Hellyer 
manipulated rate of delivery of water independently of amount. His 
intuitively surprising finding was that the effects of slowing the rate of 
delivery of a given amount of water were qualitatively similar to those 
of increasing the amount given; he also found that a statistical control 
for duration of ingestive behavior reduced the effect of both variables 
to insignificance. 

The contiguity interpretation of reinforcement does not, so far at 
least, stand on its own feet as a determinate and testable theory. The 
principal difficulty is that the intuitive notion of varying degrees of 
compatibility between different pairs of response classes has not been 
spelled out in sufficient detail. One cannot, in general, state in advance 
of an experiment for any particular pair of stimuli whether the second 
will evoke a response which interferes with the response to the first or 
one which facilitates it. 

The contiguity interpretation does, however, suggest the form of a 
testable descriptive theory. If the magnitude of a reinforcing stimulus 
determines the probability that the stimulus will evoke a response 
compatible with the reinforced response, then we can associate with 
any reinforcing stimulus a parameter c which, for any given set of 
experimental conditions, takes on the same mathematical role as the 
parameter m in Eq. (9) above.!! Thus we can predict that the asymp- 
tote of response probability will be directly related to magnitude of 
reinforcement whereas relative rate of learning will be independent of 
this parameter.!? For purposes of more detailed quantitative predic- 
tion, the value of ¢ characterizing a given reinforcing procedure for a 


"On any single trial, c and 1 — c represent the probabilities that the reinforced 
response will become connected to all stimulus elements in the trial sample [then 
Ap = O(1 — p)] or will lose all its connections to elements in the trial sample (then 
Ap = —9f), respectively. 

Data which appear to confirm these predictions have been reported by L. H. 


Hughes, Jr., in a paper, Saccharine reinforcement i T hysiol. 
Psychol., 1957, 50, 431-435. ra: 
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given population of organisms must be evaluated from experimental 
data. As in the case of the rate parameter © circularity is avoided by 
virtue of the fact that the parameter c appears in numerous independ- 
ent predictive relationships all derivable from the same model. 

When relationships among variables in a problem area are not 
thoroughly understood, useful parametric data cannot always be had 
for the asking. A number of years ago I initiated a study with the mod- 
est purpose of determining the functional relationship between rate of 
bar pressing and concentration of sucrose solution used as reinforcing 
agent for the range of concentrations below those covered in Guttman’s 
[48] experiment. Substantial quantities of data were collected, but 
the relationship at issue has yet to be determined. The first unexpected 
finding was that most animals could not be conditioned with a rein- 
forcement of 4 per cent sucrose solution, yet after training with a more 
concentrated solution, the same animals would maintain maximal 
rates of responding, apparently indefinitely, on the 4 per cent solution. 
The second, and for my original purpose still more inconvenient com- 
plication, was that even for well-trained animals, no orderly rate func- 
tion could be obtained at low concentrations. For any given animal, 
near maximal response rates could be maintained by concentrations 
down to a critical region, near 4 per cent sucrose in most cases, but at 
lower concentrations, extinction occurred; the originally anticipated 
result of low but stable response rates at low concentrations simply did 
not materialize. Prospects of accounting for this complex set of findings 
in terms of statistical learning theory appear promising [22], but 
a comprehensive and detailed treatment of magnitude of reward in the 
free-responding situation is certainly not at hand. 

In many human learning experiments, magnitude of reinforcement 
can be interpreted in terms of the amount of information given con- 
cerning the correctness or incorrectness of the subject’s responses. 
From the standpoint of contiguity theory, the basic difficulty in han- 
dling this type of experiment is that we ordinarily have no way of 
specifying in advance how a subject will respond to an ambiguous or 
uninformative outcome. A recent approach to this problem has em- 
ployed a combination of methods, drawing upon learning theory, 
information theory, and psychophysics. The first step was to arrange 
an experimental situation in which stimuli of varying degrees of 
ambiguity could be presented, and in which the ambiguity could be 
scaled by a psychophysical procedure and related to the scale of 
response probabilities. In this situation the subject is presented on each 
trial with a pair of geometrical figures which differ in form; the sub- 
ject’s task is, first, to guess at the beginning of each trial whether the 
larger figure will appear on the left or on the right, and second, 
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when the stimuli are presented, to judge which is the larger. Now if by 
appropriate preliminary experimentation we obtain a set of stimulus 
pairs which evokes, on the average, a proportion of right judgments 
(i.e., judgments that the right-hand figure is larger in area), we can 
enter this value of m in Eq. (9) and predict the curve of learning in 
terms of probability of right responses (guesses). As in the simple 
Humphreys-type guessing situation, the parameter m should determine 
the asymptotic probability of the guessing response. 

Some evidence concerning the adequacy of this approach is 
presented in Table 1. The data were obtained from four groups" of 


Taste 1 


Asymptotic response probability 


Group| N 
Theoretical | Observed 
1 10 .85 | .87 
229) 10 -73 -66 
3, 41-710 36 | 39 
4 | 10 .14 ll 


subjects run for 216 trials in the situation described above (unpublished 
study by M. Johns and the writer). The observed values of asymptotic 
response probability are mean proportions of “right” guesses over 
the last 72 trials of the series; the theoretical values (estimates of r) are 
mean proportions of “right” judgments. These results are encouraging 
enough to suggest that other human learning situations in which rein- 
forcement is identified with knowledge of results may yield to similar 
analyses. For such situations, it appears that reinforcing value of a 
stimulus is inversely related to the ambiguity, or uncertainty, asso- 
ciated with the stimulus, and that by virtue of this relationship, the 
reinforcing value can be quantitatively scaled by an independent setof 
operations. 

Partial reinforcement and resistance to extinction. The almost ubiquitous 
finding that resistance to extinction is greater after partial than after 
continuous reinforcement has been a challenge to all varieties of rein- 
forcement theory, and the present theory is no exception. Some re- 
search indirectly related to this problem has been discussed above in 
connection with our attempts to account for the vagaries of non- 


18 For groups 1 and 4, the two figures of every pair differed slightly in area. For 
groups 2 and 3, the two figures of each pair were actually equal in area, but one figure 


of each pair was judged larger with significantly greater than 50 per cent frequency by 
our population of subjects. : 
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reinforcement in T-maze learning. In an extensive series of studies, 
Weinstock!* [92, 93], Lauer [64, 66], and the writer have found that 
the decremental effect of nonreinforcement varies systematically as a 
function of number and distribution of reinforcements and nonrein- 
forcements, and Weinstock [92] has investigated possible modifications 
in present statistical models which might account for the observed 
relationships. 

At a qualitative level, all these researches provide indirect evi- 
dence at least mildly favorable to a contiguity-interference interpreta- 
tion. According to the present theory, the mechanism for extinction of 
an instrumental response is conceived as follows. The function of non- 
reinforcement is to establish a stimulating situation in which compet- 
ing responses have high probabilities. When the competing responses 
are evoked in the presence of stimulus elements which are connected to 
the formerly reinforced response, conditioning occurs in accordance 
with the contiguity principle, and the competing responses gain con- 
nections at the expense of the formerly reinforced one. During a series 
of reinforced trials, some one response, e.g., approaching a food tray, 
becomes well conditioned to the reinforcing situation, e.g., the goal 
box of a maze. Then when reinforcement is first omitted, cues from 
the changed situation and cues arising from the disrupted response 
sequence evoke competing responses with high probability.!? Conse- 
quently, extinction tends to be rapid following a series of continuously 
reinforced trials. Under a partial reinforcement schedule, the situa- 
tion is somewhat different. Competing responses which become condi- 
tioned following a nonreinforcement early in the series will tend to 
occur on later reinforced trials; therefore, response-produced stimuli 
associated with them will become conditioned cues for the reinforced 
response and will tend to maintain the latter during a subsequent 
extinction series. 

An advantageous feature of the interference interpretation is that, 
unlike other hypotheses that have been proposed, it accounts for the 
relationship between partial reinforcement and resistance to extinction 


H A valuable byproduct of Weinstock’s experiments is a quantity of data on dis- 
tribution of running times; Bush and Mosteller [13, chap. 14] have reported some 
analyses of these data in terms of a response model substantially the same as the one 
discussed in an earlier section of this paper. 

15 As in the interpretation of reinforcement discussed above, the initial application 
of contiguity-interference principles to any experiment involving extinction suffers 
ambiguity because there are no general rules for prescribing just what responses will 
“compete” with a given response. However, once definite assumptions have been made 
concerning the conditions of response competition in a given situation in order to 
Predict or explain some particular experimental result, the theory becomes deter- 
minate for that situation and can be expected to yield additional testable predictions. 
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under conditions of widely spaced trials. When trials are temporally 
massed, the “trace hypothesis” proposed by Hull [57] enters the pic- 
ture. According to my interpretation of Hull’s hypothesis, some of the 
stimulus elements associated with nonreinforcement on a given trial of 
a partial series may, if the intertrial interval is not too great, appear in 
the sample of elements available at the beginning of a subsequent 
reinforced trial. Consequently, cues associated with nonreinforcement 
may become conditioned cues for the reinforced response and tend to 
maintain it during a following extinction series. In a recent paper on 
discrimination learning [10], C. J. Burke and I have shown that a 
quantitative account of this type of trace-conditioning can be derived 
from the assumptions of statistical learning theory. 
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Fic, 10, Schematic representation of temporal relations 
among stimulus and response events in an escape learn- 
ing experiment, 


Escape and avoidance learning. In Fig. 10 we have schematized the two 
kinds of trials that may occur in a simple escape learning experiment. 
Response class A; represents all actions that result in termination of the 
unconditioned (to-be-escaped) stimulus and class A, all actions that do 
not. Each trial begins with presentation of the conditioned stimulus 
and, either simultaneously or after a short interval, the unconditioned 
stimulus. In the first trial shown in Fig. 10, response A; occurs im- 
mediately and terminates both stimuli. In the second trial, one or more 
Tesponses belonging to class A» occur first, then A; occurs and ter- 
minates both stimuli. According to the statistical model, the popula- 
tions of stimulus elements associated with the conditioned and un- 
conditioned stimulus sources are available for sampling whenever 
the sources are operative. On trials of the first type, only the A 
response is evoked contiguously with the conditioned stimulus sample 
and therefore should become conditioned to it. On trials of the second 
type, the A; response is the last to occur in the presence of the condi- 
tioned stimulus sample and therefore should remain connected to all 


The Statistical Approach to Learning Theory 439 


elements in the sample at the end of the trial.!* Thus on trials of both 
types; the terminal response should receive the increment in probabil- 
ity specified by Eq. (5). 

Data suitable to test a number of predictions from this theory of 
escape learning have been obtained by Straughan [88]. In Straughan’s 
study, human subjects learned to terminate a loud tone by selecting 
the correct one of two motor responses. For some of Straughan’s sub- 
jects, reinforcement was partial rather than continuous, the 4; and A» 
responses being reinforced by termination of the tone with probabili- 
ies .7 and .3, respectively. Utilizing Eq. (9), we can predict that 
terminal probability of the A; response should be 1.0 and .7 in the 
continuously and partially reinforced groups, respectively. Also, by 
means of a derivation that is straightforward but too lengthy to review 
here, we can predict that terminal latencies of the A; and A, responses 
will be equal for each group (although not necessarily the same for 
different reinforcement schedules). Observed 4; proportions over the 
ast 20 trials of an 80-trial series for Straughan’s continuously and par- 
tially reinforced groups (N = 64 for each group) were .999 and .714, 
respectively. Mean latencies of the Ai and A, responses over the same 
trial block for the partially reinforced group were 1.40 and 1.45 sec; 
no comparison of terminal latencies could be made for the continu- 
ously reinforced group, of course, since the As response had virtually 
ceased to occur. 

The effective scope of statistical learning theory in this general class 
of human learning experiments has been further extended by Brody [8] 
who introduced additional complications by including both avoidance 
and escape contingencies. Brody’s subjects were permitted to choose 
between two mutually exclusive responses on any trial. A given re- 
sponse served to prevent the occurrence of a to-be-avoided stimulus 
with a predetermined probability, say Ta: On trials when this stimulus 
occurred, the same response served to terminate it with a second 
predetermined probability, say me. Utilizing for nonavoidance trials 
the same analysis discussed above in relation to escape learning trials, 
and assuming that on avoidance trials the successful avoidance re- 
sponse becomes connected to all stimulus elements in the trial sample, 
Brody derived functions which provided good fits to his empirical 
acquisition curves. As can be seen in Table 2, Brody also succeeded in 


16 It is possible that on extended trials of the second type, more than one sample 
would be drawn from the conditioned stimulus population. Since only the last sample 
would become conditioned to the A, response, the increment in response probability 
would be less than on a trial of the first type. The consequences of this modified assump- 
tion will not be pursued here, but it should be mentioned that the predictions concern- 
ing asymptotic probabilities, discussed below, would not be changed. 
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predicting the asymptotic probabilities of partially reinforced avoid- 
ance responses with accuracy comparable to that achieved by previous 
investigators for simple verbal conditioning experiments. The theo- 
retical probabilities in Table 2 are computed from the formula 
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A subsidiary aspect of Brody’s study had to do with temporal 
relations between termination of the conditioned stimulus and oc- 
currence of the avoidance or escape response. Each trial began with 
presentation of the conditioned stimulus, or “signal,” and the uncon- 
ditioned stimulus followed only if the predesignated avoidance re- 
sponse failed to occur within a critical interval of approximately 2 sec. 
For half the subjects, the signal terminated at the end of the critical 
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TABLE 2 


| Reinforcement Asymptotic response 


Group N | Probabilities probabilities 
Ta Te Theoretical | Observed 
I 32 .66 1.0 1.0 .92 
Il 32 -66 66 79 12 
III 32 .66 .34 -50 253 
IV 32 90 .30 | AR .77 


interval regardless of the subject’s response (noncontingent condition). 
For the remaining subjects, conditions were arranged so that the signal 
terminated with the avoidance response, if the response occurred 
within the critical interval, and otherwise terminated with the uncon- 
ditioned stimulus when the escape response occurred (contingent 
condition). In terms of contiguity theory, optimal conditions for 
acquisition prevailed for the contingent group, since the successful 
avoidance or escape response was always the last response to occur in 
the presence of a trial sample from the conditioned stimulus popula- 
tion. For the noncontingent group, however, the last response to occur 
in the presence of a conditioned stimulus sample, particularly on early 
trials, would sometimes be an incorrect response; and for these trials 
no increment in probability of the correct response could be predicted. 
Brody’s data demonstrated that, as anticipated, the contingent pro- 
cedure significantly facilitated acquisition of the avoidance response. 
Drive and conditioning. Throughout the preceding sections of this 
report, no explicit attention has been given the variables and relation- 
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ships subsumed under the concept of “drive” in the theories of Hull 
(55, 56] and Spence [87]. The omission is attributable largely to the 
fact that most of the experimental situations to which statistical learn- 
ing theory has been applied do not involve changes in motivating 
conditions as independent variables. One should not conclude, how- 
ever, that these conditions have been ignored, or even regarded as 
unimportant. 

In my first paper on statistical learning theory [19], I assumed that 
“drive-inducing” operations, e.g., food deprivation, electric shock, 
affect either the makeup of the stimulus population characterizing a 
given situation or the rate at which the population is sampled by the 
organism. This assumption has been routinely taken into account in 
the designing of experiments to test other features of the theory. 
Especially in animal experiments, deprivational conditions, if they are 
not undergoing systematic variation as part of an experimental design, 
must be held constant from trial to trial during learning. In order to 
satisfy this requirement, I and my associates have come to rely in- 
creasingly upon 24-hr trial spacing in maze and runway experiments. 

When occasion arises to represent deprivational variables formally 
in the present theory, it is assumed that an operation such as food dep- 
rivation controls the availability of a specific population of stimuli, 
presumably largely visceral in origin. Degree of deprivation influ- 
ences the probability that any cue (“stimulus element”) in this popula- 
tion will be sampled on any single trial. Shifts in drive level (as 
operationally defined) produce changes in the relative contribution of 
the drive population to the stimulus sample affecting the organism on 
any trial. If, owing to previous training, the drive stimuli are pre- 
ponderantly connected to the response that is being recorded, a sud- 
den increase in drive level will yield a sudden increase in probability 
of that response; if the drive stimuli are preponderantly connected to 
competing responses, a sudden increase in drive level will yield a 
decrease in probability of the recorded response. Populations of stimuli 
associated with two different operationally defined drives, say hunger 
and thirst, may have elements in common. The degree of communality 
will determine the extent to which effects of reinforcement or extinc- 
tion under one of the drives will generalize to the other and will also 
determine the limiting accuracy of discrimination attainable between 
the two drive conditions. The parameters representing sizes of drive- 
stimulus populations, associated © values, and degrees of communality 
between populations associated with different operationally defined 
drives must be evaluated from behavioral data for any given situation, 
but then can be used, just as any parameters of the model, to generate 
predictions about behavior under new combinations of conditions. 
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Detailed assumptions concerning the populations of drive stimuli 
involved in problems of generalization between hunger and thirst 
drives in an operant conditioning situation have been formulated and 
tested at a qualitative level by Bishop [6]. Parametric studies are 
needed to determine functional relationships between properties of 
drive-stimulus populations and such observable independent variables 
as deprivation time in standard experimental situations. 

Verbal learning. From the standpoint of the present theory, paired- 
associate learning is regarded as an instance of multiple discrimination 
learning. Therefore we should expect experimental laws for paired- 
associate learning to be derivable from the statistical model developed 
in connection with simple acquisition and discrimination experiments. 
In a typical paired-associate study, the subject is given repeated pres- 
entations of a list of stimuli, usually printed words or nonsense sylla- 
bles, and is required to associate a particular response word with each 
item of the list. The stimulus items are presented singly, in varying 
orders from one showing of the list to the next, and reinforcement con- 
sists in exhibiting the correct response word after the subject’s response 
on each trial. To represent such an experiment theoretically, we define 
a collection of stimulus sets, $1, 52, . . . , Sx, one for each item in the 
stimulus list, and corresponding response classes, A1, Ay, . . . , Ar- 
Response Aj, is reinforced on a given trial if and only if stimulus $p is 
presented. Probability of a correct response to stimulus S;, on the 
nth showing of the list should, then, be given by the same function as 
Eq. (6) above, namely, 
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From this relation, one can readily compute theoretical curves for 
mean proportion of correct responses against trials, families of curves 
for the probability of any designated number of correct responses per 
trial, and numerous other testable functional relationships. For purely 
illustrative purposes, I have plotted two of the theoretical curves 
derivable for learning of a 10-item list in Fig. 11, together with 
empirical data representing 108 protocols from a group of relatively 
slow-learning subjects.!” Systematic tests of the analysis sketched here 
are in progress in the Indiana laboratory, but none of this research 
has yet been published. 

Serial verbal learning involves more difficult theoretical problems, 
partly because there are so many quantitative relationships to be 
accounted for. Atkinson [3] has constructed a statistical model which 
describes a number of serial position effects. Probably the most valua- 
ble immediate outcome of Atkinson’s theoretical investigation is its 

17 I am indebted to Dr. Benton J. Underwood for these paired-associate data. 
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empirical byproducts. In order to facilitate applications of his theo- 
retical functions, Atkinson conducted a series of carefully designed 
experiments which have clarified a number of relationships between 
the serial position curve and independent variables. The model itself, 
still in a preliminary form, involves several new assumptions which 
require independent tests before it can be properly evaluated. 
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Fic. 11. Theoretical and empirical curves of paired-associate 
learning. Upper panel: proportion of correct responses per trial 
for the set of 108 protocols. Lower panel: proportion of proto- 
cols exhibiting exactly two correct responses on each trial. The 
@ value of .13 determined from the upper curve was used to 
compute a priori predictions for the lower curve. 


Critically needed research. The items mentioned in this section will 
represent a very small sample from a very large population. Our best 
experimental efforts seem only to whet the insatiable demand of a 
growing quantitative theory for more and better data. The simplest 
problems turn into complex ones when we try to progress from qualita- 
tive to quantitative description and from description to prediction. 
We should like to place a standing order for more good parametric 
experiments in all areas of learning; but who will fill the order? 
Research designed to refine our measurements and descriptions of 
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functional relationships ordinarily seems too much like drudgery to 
attract many investigators. It begins to take on glamour, however, 
when there are quantitative theories to be tested. Under the instiga- 
tion of contemporary theoretical developments, the output of needed 
types of parametric research is increasing, but I can see no immediate 
danger of a surplus. 

Some of the most critical areas for the immediate needs of statistical 
learning theory have become apparent in the course of our review. In 
many instances, curiously enough, these involve, not new research 
problems, but classical ones in areas that the nonspecialist looks upon 
as “all worked out.” To review just one example, time relations in 
classical conditioning and habituatory decrement require clarifica- 
tion before much progress toward an adequate quantitative theory can 
be anticipated. There have been numerous studies of the CS-US inter- 
val in eyelid conditioning, but none that suffices to determine whether 
the asymptotes or the slope constants of conditioning curves, or both, 
are affected by variation of the interval; an empirical decision on this 
point is needed to guide formulation of a stimulus trace model within 
statistical learning theory. There have been a few studies of US 
duration, but none that have partialed out the interval from comple- 
tion of the UR to termination of the US; the role of this independent 
variable must be understood before we can even tentatively appraise 
the adequacy of an interference account of response decrement in 
conditioning situations. 

In general, research on learning in classical situations has been all 
but dominated by “critical ratio,” and more recently “analysis of 
variance” research. Much is known about methods of producing 
significant group differences in standard learning situations, but little 
about the limits within which learning can be modified by manipula- 
tion of standard independent variables. For all the studies of distribu- 
tion of practice in conditioning or in simple verbal learning, we do not 
know even roughly within what extremes learning rate can be varied 
by manipulation of intertrial intervals. Only after decades of research 
on eyelid conditioning have there begun to appear studies [e.g., 91] 
showing how rapidly conditioning can be achieved when radical 
measures are taken to reduce stimulus variability from trial to trial. 
Countless similar examples could be adduced. Quantitative learning 
theories would pay their way if they did nothing but lead to increased 
emphasis on exploring the full range of effects of independent variables 
and on determining forms of functional relationships between 
variables. 

Another critically needed shift in conventional research strategies 
has been urged most eloquently by Harlow [50], although for some- 
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what different reasons from those I shall emphasize. In the vast 
majority of studies of learning, the yield is a single set of data for the 
groups of organisms investigated. This one limitation in itself makes 
the data of most experiments almost useless for purposes of testing a 
new quantitative theory. As we have demonstrated in the case of 
numerous specific problems in the preceding pages, the testing of a 
quantitative theory normally involves at least two distinct steps: 
evaluating the parameters of the model from one set of data generated 
by a given subject or group of subjects, and then making predictions 
concerning new sets of data obtainable from the same subjects. So 
simple an experiment as repeated acquisition and extinction of a 
classical conditioned response with well-spaced trials would be of 
more value to statistical learning theory at the present time than any 
number of significant statistics demonstrating effects of ingeniously 
contrived new variations on conventional conditioning procedures. 

Finally, it should be mentioned that the learning theorist shares 
with his fellows in numerous related disciplines a need for truly com- 
parative studies of learning. By this I mean, not explorations of be- 
havioral oddities in little-known bypaths of the animal world, but 
rather systematic investigations in which experiments of standard 
types are conducted “in parallel” at a number of phylogenetic levels. 
All the published research pertaining to statistical learning theory has 
been conducted with vertebrates, and indeed almost all with either 
rats or men. I have tacitly assumed that all general assumptions and 
laws of the theory can be applied at least throughout the range of the 
vertebrates with only numerical values of parameters changing as we 
go from one particular animal form to the next. At present, however, 
this assumption amounts to little more than an article of faith. Avail- 
able comparative data concerning the quantitative effects of the major 
independent variables treated in this report are insufficient even to 
spell out in detail what we mean when we speak of “the same law” 
applying to different animal forms. 


Part III. Systematic and Comparative Analysis 


FORMAL STRUCTURE OF STATISTICAL LEARNING THEORY 


It will be apparent from the preceding account of the development 
of statistical learning theory that we shall have trouble in assembling 
an exhaustive yet logically arranged and neatly rounded summariza- 
tion of “the theory” or “the system.” The crux of the difficulty is that 
we cannot give any sharply defined referent for “the theory” or “the 


446 W. K. ESTES 


system.” If, like many of the earlier psychological systematists, we had 
started with a comprehensive program for theory construction, we 
could simply present the master plan and indicate how far it had been 
filled in. But we have no program. What we now call the statistical, 
or probabilistic, approach to learning theory began, not with a blue- 
print, but merely with a handful of promising (or more accurately, 
tantalizing) experimental findings and an interest in seeing what 
progress could be made toward a rigorous quantitative theory. If we 
had started out to account for a definite set of facts, we might hope to 
be approaching a definite and clearly delimited theoretical model 
representing those facts. But many of the facts that are now being 
described and predicted by statistical learning theory have been dis- 
covered during the period of its development, frequently as a direct 
result of applications and tests of the developing theory. Conse- 
quently, the outlines of “the system” are as uneven, and often hazy, 
as the lines of experimental advance in the study of learning. 

In the preceding sections of this report, statistical learning theory 
has designated an evolving collection of concepts, formal methods, and 
empirical laws organized by these concepts and methods. Now that our 
attention turns to structural considerations, our usage of the term 
theory shifts to the more static, conventional sense, i.e., a cross-sec- 
tional description at a given point in time, emphasizing the formal 
concepts and deductive machinery, and ignoring the messy complica- 
tions that characterize any interaction of formal and empirical sys- 
tems. This type of description, taken alone, is misleading, for it in- 
volves the same kinds of simplification, idealization, and abstraction 
that inevitably occur when we go from empirical phenomena to 
conceptual representations of them. It is only in the perspective 
provided by the more naturalistic account given in Part II that we 
can even attempt the summary and systematic analysis needed for 
purposes of a comparative survey of contemporary systems and 
theories. With the reservations indicated above, I shall now attempt 
to outline the formal model emerging from some ten years of interac- 
tion between the statistical approach to learning theory and asso- 
ciated experimental research. i 

By the term “model” I shall denote specifically the formal system 
associated with a scientific theory, i.e., the mathematical symbols and 
the assumed interrelationships among them. When a model has been 
interpreted in terms of a system of empirical events, it becomes a 
theory. Thus a theory may be conceived as a mapping of an empirical 
system onto a formal system, as illustrated for statistical learning 
theory by the following schema. Within the empirical system, two 
principal types of entities must be distinguished: events and properties 
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or aspects of events. The rules of correspondence similarly fall into two 
categories: coordinating definitions, which set elements of the formal 
system in correspondence with classes of events’? in the empirical 
system; and interpretive rules, which specify how mathematical 
functions, variables, and parameters defined within the formal system 
may be evaluated from data representing observable, and usually 
measurable, properties or aspects of empirical events. 


Formal system S E A 

Rules of correspondence wa 

Empirical system Physically Operationally 
defined classes of defined classes of 
environmental observable 
events behaviors 


In this section of Part III, I shall first summarize the formal terms 
(referred to in the Project A outline as “systematic concepts”) of 
statistical learning theory and the assumed relations among these 
terms, then proceed to consider empirical interpretations and deriva- 
tions of theorems which, upon interpretation and experimental 
confirmation, become the empirical laws of the theory. 


Itemization of Systematic Concepts 


It will be convenient to divide our itemization of theoretical in- 
gredients into two main parts. The first part, summarized in Table 3, 
will comprise the primitive set-theoretical constructs which are basic 
to all of our formal theorizing. The second part will comprise the 
theoretical variables which appear either as systematic dependent or 
independent variables or as parameters in the assumptions (axioms) 
of the theory and, therefore, in all descriptive statements derived from 
them. This second list, presented in Table 4, is by no means complete, 
but it includes all the theoretical variables that have appeared promi- 
nently in published presentations or applications of the theory up to 
the time of writing. These constructs and variables have all been 
introduced in Part II, so we can proceed directly to questions of their 
definition and formal status. 


Mode of Definition of Systematic Concepts 


The constructs listed in Tables 3 and 4 have the status of primitive 
notions within statistical learning theories and are strictly definable 


‘8 More accurately, coordinating definitions establish correspondences between 
terms of the formal system and terms of the data-language [cf. 35, pp. 320-322]. 
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only in mathematical terms. Our technique of formalization is to 
represent stimuli, responses, and reinforcing events”? by finite mathe- 
matical sets and then to generate the systematic dependent and inde- 
pendent variables and parameters of models for learning experiments 
by defining functions on these sets. 

The sets and their elements serve as abstract conceptual devices 
which are useful in deriving quantitative laws but which never appear 
in the final product. It will make no sense to ask where a stimulus 
element is located, or even whether it “intervenes” between observed 


TABLE 3. SET-THEORETICAL CONSTRUCTS OF STATISTICAL LEARNING THEORIES 


Construct Characterization 
Aj (j = 1,2, ... 7) Setofr mutually exclusive and exhaustive response classes. 
sie Set of “stimulus elements” representing all stimulus events 


(all stimulation) that can affect the organismf in any 
situation at any time. 


Sx Subset of S* representing all stimulus events that can occur 
in a particular experimental situation at any time. 
Ay Subset of Sẹ representing all stimulus events that can occur 


in a particular experimental situation during a particular 
experimental period. 

S = S, —S Subset of Sẹ representing stimulus events which sometimes 
occur in a given experimental situation but are unavail- 
able during a particular experimental period. 

nh = 1, 2.4 5%) Subsets of Sẹ (or of S, depending on the assumption made in 
a particular application), one corresponding to each 
experimenter-defined stimulating situation (pattern of 
stimulation) that may occur in a given experiment. 


STs Sie iy Si Shy Subsets of elements in S*, Sy, S, or Sh respectively, that are 
connected to response Aj. 
`, (k = 1,2, ...,r) Set of reinforcing events, one corresponding to each response 
class. 


+ It is to be understood that all the stimulus sets are associated with some indi- 
vidual organism in any psychological application. 


stimuli and responses, for stimulus sets and their elements belong to 
the formal, not the empirical, system. Properties or aspects of these 
mathematical entities are, however, assumed to represent properties or 
aspects of observable behavioral events. Thus ©, the sampling proba- 
bility of a stimulus element, or N, the number of elements in a stimulus 
set, does appear in the descriptive statements derivable within the 


19 Presumably “reinforcing events” can ultimately be reduced to stimulus-re- 
sponse terms, and one of the chief purposes of contiguity-interference theory is to 
effectuate this reduction. However, applications of our mathematical theory require 
only that one be able to locate experimenter-defined events whose effects on response 
probabilities conform to the assumptions of the theory. 
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theory, and rules must be given specifying how values of these varia- 
bles can be determined from empirical data. 

For purposes of comparing the present theory with others, it might 
be helpful to indicate how the theoretical variables of Table 4 fit into 
the conventional intervening-variable paradigm. For simplicity, only 
a few of the most important variables are included in the schema 
shown on page 450. In this schema it is assumed that the “empiri- 


TABLE 4. THEORETICAL VARIABLES 


Construct Characterization 
n Number of trials. 
r Number of response classes. 
K Number of experimenter-defined stimulus situations. 
hy Time required to execute response Aj. 
Phiyn Probability of A; on trial n in presence of stimulus situation 


corresponding to Sa. If there is only one stimulus situa- 
tion, the subscript A is dropped; if there are only two 
response classes, the subscript j is dropped and p, is taken 
to refer to Aı; if reference to trial number is unnecessary, 
the n is omitted (yielding such variants as pj,n, Pn, P): 
The argument n may be replaced by ¢ when appropriate. 

Friin Probability that the ith stimulus element in set Sh is con- 
nected to A; on trial n. As above, unnecessary subscripts 
and arguments are dropped, yielding Fitim Fin, Fi etc. 

On Sampling probability of the ith element in S}. The sub- 
script h is dropped if only one set is defined for a given 
experiment, yielding ®;. 


j Probability that any given element in $ becomes unavail- 

y able (enters S’) during an interval At. 

] Probability that any given element in S’ becomes available 
(enters $) during an interval Ar. 

Thik yn Probability of E, given that A; occurs in the presence of Sh 


on trial n. The most common variants are mj, (constant 
probability of E} following A; in a single stimulus situa- 
tion), and m (constant probability of Eı in a single- 
A stimulus, two-alternative experiment). 
N*, Ny, N, N’, Ni, Naj Number of elements in S*, Sy, S, S’, Sh, or Shis respectively. 


cal” variables are strictly definable in terms of the data-language 
without reference to the theory. The lines between different pairs of 
variables signify either interpretive rules or assumed functional rela- 
tionships, as appropriate. 

The conceptual status of such theoretical variables as N and p in 
the present theory is formally analogous to that of such terms as D 
(drive strength) and „E (reaction potential) in Hull’s system [55]. In 
each case, simplicity of theoretical structure is gained by stating as- 
sumptions about learning in terms of theoretical variables rather than 
directly in terms of empirical dependent and independent variables. 
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It should be pointed out that although constructs of thë kind listed 
in Table 4 will fill the roles of mathematical dependent and independ- 
ent variables in all functions derived within the theory, no one con- 
struct can be rigidly classified in either category, In an acquisition 
function, for example, p might be the dependent variable, n the inde- 
pendent variable, and © a parameter. But in a function predicting 
responses to extinction as a function of stimulus variability, n might 
appear as the dependent variable, © as the independent variable, and 


fp as a parameter. 


Relative frequency Latency of 
da response: a response 
Inder ona > “ti 
heh ee 
u i ON 
Number of food Intensity of 
conditioned 
Number of unconditioned stimulus 
stimulus presentations 


Major Interrelations among Constructs 

In this section we shall bring together the assumptions, or more 
i Ore aes the constructs item- 
ized in ‘Tables 3 and 4, and some theorems which are of wide applicabil- 
ity in empirical interpretations of the model. Derivations of the theo- 
rema are available in published sources and will not be repeated here. 
For convenience of reference, the assumptions will be grouped under 


terpreted as a sample space representing the posible behavioral out- 
comes of a trial, The element A; then represents the event “response A, 
occurs.” The probability of A, is defined as p; ic. 


MA) =p G=...) 


The mutually exclusive and exhaustive character of the response 
lames is expremed by the following assumptions: 


L Znas 
K 
2. PA, U Aa) = pi + Po 
and similarly for any combination of response clases. Or, verbally 
exprewed, the probabilities of the response clases sum to unity, and 
the probability that the response which occurs on a trial will belong to 
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pecificd subset of response classes is equal to the sum of the probabil- 

ties associated with those response classes, Using only these assump- 

fons together with the classical calculus of probabilities [see Feller, 38, 

for details], many important theorems are derivable, eg.: 

a. The expected number of occurrences of Ay in n trials ix equal 
to 


piat Piatt + Pia 
where py, represents probability of A, on trial ». If py is con- 
stant over the series, this sum reduces to mp;. 

b. The probability that A, occurs for the first time on the mh 
trial of a series is equal to 

(i = pipi = Pia) | Dia u = Pit) Pin 

If p; is constant over the series, this product reduces to (1 = py)" "py 

se theorems are of frequent application in deducing relations be» 

n response probabilities and empirical dependent variables, 

Stimulus sampling model, For simplicity of exposition we shall 

sider only assumptions referring to a single stimulus set $ (and 
appropriate, its complement $”) in the remainder of this section. 

The following two assumptions are basic to derivations involving 
limulus sets: 

1. The probability that the ith element in $ is sampled on any 
trial equals ©, Except when specified otherwise, ©, is assumed 
to be a parameter which has a fixed value associated with any 
given combination of stimulus and response sets, 

2. The probability that the ith and jth elements of S are both 
sampled on any given trial equals 08), and similarly for any 
combination of elements. 

important relations derivable from these assumptions are: 

a. The mean stimulus sample size on any trial, designated 4, 
equals the sum of the © values over all elements of $, be, 


$ 
er. 
If all of the O; are equal, this expression reduces to 
a= NO 
b. The probability that the ith element of $ is sampled at least 
once during the first n trials of a series equals 1 - (1 — 0)". 
hes theorems are frequently useful in the derivation of learning 
inetions [sce, cg- 11, 32). 
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Stimulus fluctuation model. There are two principal assump- 
tions: 


1. The set Sx, representing all potential stimulation associated with 
a given experimental situation, is partitioned at any time into 
two subsets, § and S’, representing stimulus elements that are 
and are not, respectively, available for sampling. 

2. There is constant, random interchange of elements between 
S and S$’. During any unit time interval Aż there is probability j 
that any given element in S goes into S’ and probability j’ that 
any given element in $” goes into S. From these assumptions one 
can derive [25, 26] theorems basic to the statistical theory 
of spontaneous recovery, forgetting, and other distributional 
phenomena. If we let f(t) represent the probability that any 
given element of Sx is in S at time ¢ following an experimental 
trial (or period), then 


a. f(t) = J — [J — Ole 
where ne and a=1-j-j' 


b. When t becomes sufficiently large, the mean numbers of 
elements in S and $’ stabilize at values which satisfy the 
relation 

NEN 

N+N 

As a corollary, we note that if trials are given at widely spaced 
intervals, the sampling probability, on any trial, of the ith element of $ 
is equal to JO;. Strictly speaking, fluctuation should be taken into 
account in all applications of the model, but in deriving learning func- 
tions for experiments which take place in a single relatively short 
experimental period, it is usually expedient to assume that the amount 
of fluctuation during the period is small and to treat S' as fixed. 

Stimulus-response relations. Stimulus and response sets are 
connected by the assumptions: 


1, For any combination of stimulus and response sets, each element 
of the stimulus set bears a conditional relation to (is connected 
to) exactly one element of the response set. More formally, we 
may say that at any time the response set generates a partition 
of S into subsets $1, S2, . . . , S,, one subset corresponding to 
each of the response classes Ay, As, . . . , Ar. It is implied by 
the concept of a partition that each element of S is in one and 
only one of these subsets. 


| 
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Finally, the stimulus and response sets are related to response proba- 
bility by the following assumption: 


2. The value of p; on any trial is equal to the proportion of ele- 
ments in the trial sample that are connected to A; (i.e., that are 
in S;). 
Now, defining F;; as the probability that the ith element of 5 is 
connected to A; (i.e., is in S;), we can state a number of important 
relations: 


a. If the © values are equal for all elements of a given set S, 
then, on the average, 


Lr 
bey HF 


where F; is the proportion of the elements of $ that are in Sj. 
b. For any set $ 
Jar, 


are = xo) 


where O represents the mean value of O and = means “tends 
to equality for large N.” 

c. If the © values are equal for all elements of two nonoverlap- 
ping stimulus sets Sọ and S» containing N, and N, elements, 
respectively, and the expected probabilities of response A; in 
the presence of the two sets separately are py; and prj, re- 
spectively, then the expected probability of A; in the presence 
of the combined set Syn = Sy U Sa is given by 


_ Nahas + Naprj 
buwi = SANGER N, 


Theorems a and 6 are of frequent application in the derivation of 
learning functions, theorem c in predicting results of experiments on 
transfer or stimulus compounding. 

Learning. In general, learning is represented by changes in the 
connections between response classes and stimulus elements. We as- 
sume that learning, so defined, can occur only with respect to elements 
sampled on a trial. 

In the theoretical representation of any learning experiment, the 
term trial refers to an experimental trial as defined for the given type 
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of experiment. A trial always begins with the drawing of a sample of 
elements from a stimulus set S$ (corresponding to the experimental 
presentation of a CS, discriminative cue, or ready signal to the sub- 
ject) followed by the occurrence of one of the responses A1, 42, . . . , 
A, (corresponding to the subject’s response to the CS, cue, or ready 
signal). The response may in turn be followed by occurrence of one of 
the reinforcing events E1, Es, . . . , Er (corresponding to such experi- 
mental operations as presenting a US, reward, or knowledge of results). 
It is assumed that when no reinforcing event occurs on a trial, no 
learning occurs, but when some reinforcing event occurs, all elements 
sampled become connected to the reinforced response. In more 
formal terms: 


1. If element i is not sampled on trial n — 1, or if element 7 is 
sampled on trial n — 1 and no reinforcement occurs, then 


Fiza FF Fijn—1 
2. If reinforcing event E; occurs on trial n — 1, then 
Fijn = (1 — O)Fijn—a + O; 
and Fin = (1 —01)Fizn forallk #j 
i.e., the probability that the element is connected to A; in- 


creases and the probability that it is connected to any other 
response class decreases. 


The following theorems are basic to most of the learning functions that 
have been utilized in applications of the theory. 
Tf O values are equal for all elements of a stimulus set S, then: 


a. if no reinforcing event occurs on trial n — 1, 


Din = Pimi 
i.e., expected response probability does not change; and 
b. if reinforcing event E; occurs, 
Bin = (1 — O)Pj na + © 
and bin = (1 —O)pena forall k #7 


i.e., the probability of A; increases and the probabilities of all 
other responses decrease. 


Itemization of Empirical Dependent and Independent 
Variables 


Dependent variables. In all cases, dependent variables are meas“ 
ures relating to experimentally defined response classes. For any par- 
ticular experiment, the response classification is defined by the record- 
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ing procedures. In a runway experiment, for example, the possible 
behaviors during any time interval At may be categorized into classes 
A; and 4», i.e., behaviors which result in breaking of a photocell beam 
and behaviors which do not. In a verbal conditioning experiment, the 
subject’s behavior on any trial may be categorized into classes Ai, As, 
22 2 , A, where A; includes all behaviors associated with prediction of 
the jth reinforcing event. The dependent variables most frequently 
utilized in interpretations of the theory are: 

1. Number of occurrences of a given response 4; (for brevity we 
shall use “response A,” or simply A; for “an instance of response class 
A;?) on a given trial for a group of subjects. 

2. Number of occurrences of A; in a block of trials for a given 
subject. 

3. Number of (unreinforced) occurrences of A; to a criterion 
of extinction.”? 

4. Number of trials to a criterion of acquisition or extinction. 

5. Rate of occurrence of A; in a free responding situation. 

6. Time before first occurrence of A; on a given trial. 

Independent variables. I assume that all empirical independent 
variables (causal variables, antecedent conditions, or determinants of 
behavior) which enter into behavioral laws influence behavior by 
way of stimulation. This does not mean that hormones, genetic deter- 
miners, surgical interventions, and the like are considered unimpor- 
tant, but simply that they are considered in a formal behavioral 
theory only as determiners of parameter values. Since the term 
stimulus and its variants have many common meanings [cf. 35, pp. 284- 
287 for a discussion], I had best indicate my own usage before con- 
tinuing this section. By way of orientation, it should be emphasized 
at the outset that by stimulus and all variants of the term I refer to 
environmental conditions describable in physical terms without refer- 
ence to the behavior of the organism. By stimulating situation or simply 
situation I mean to indicate all sources of stimulation that are men- 
tioned in the experimenter’s description of the experimental situa- 
tion,?! and by stimulus or cue any independently manipulable part ofa 


f 20 Such theoretical terms as response, learning, and extinction do not properly belong 
in this section and should be taken merely as shorthand for corresponding data- 
language specifications which would be unduly tedious to spell out in detail. 

*1 This appeal to the experimenter’s description may seem to make us unduly 
vulnerable to errors or omissions in conventional categorizations. I believe, however, 
that we are merely recognizing the existing state of affairs, not adding any new com- 
plications. Asin any research program, we hope to refine our descriptions by successive 
approximations. Initially we accept the distinctions and classifications of contemporary 
ee! practice; but we expect later to use the developing theory to sharpen 

em, 
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situation. No situation is precisely reproducible, so just as in the case of 
response variables, the itemization of stimulus variables involves 
classification. In a conditioning experiment, for example, we designate 
by “the CS” the class of situations an instance of which occurs at the 
beginning of each trial; in the usual discrimination experiment we 
recognize two classes of situation, one correlated with reinforcement 
and one with nonreinforcement of a given response class. With these 
understandings, we can proceed to summarize the three principal 
types of empirical independent variables involved in studies of learn- 
ing, transfer, and retention : 

1. Number of reinforced or unreinforced trials in a given situation. 
In this context, reinforcement and nonreinforcement refer, of course, 
to experimentally defined classes of events. 

2. Variations in components or properties of a stimulating situa- 
tion. 

3. Time between learning trials in a given situation or time inter- 
val during which the organism is not exposed to a given stimulating 
situation. 


Interpretation of Formal Theory in Terms of Empirical 
Variables 


Rules of correspondence between set-theoretical constructs and 
empirical variables. Strictly speaking, the correspondences between 
set-theoretical constructs and experimentally defined variables always 
have the status of hypotheses; in many common experimental situa- 
tions, however, the correspondences are either so intuitively obvious or 
so well established by successful applications of the theory that we tend 
to lose sight of their hypothetical character. The problems and issues 
involved are somewhat different for constructs associated with stimuli, 
responses, and reinforcing events, so we shall discuss them separately. 

Stimuli. In formulating a set-theoretical representation of a stimu- 
lating situation, we must, initially at least, assume that the experi- 
menter’s description of the situation is sufficiently accurate to indicate 
which sources of variation may be considered negligible and which 
must be taken into account. If we represent the CS or ready signal in a 
simple acquisition experiment by a single set S, we are assuming the 
situation to be sufficiently reproducible from trial to trial so that 
fluctuations in the available stimulation can be considered negligible. 
Usually it is fairly obvious a priori whether the assumption is reason- 
able. An advantage of using a standard experimental situation for 
theoretical tests is that once particular coordinating assumptions 
have been found satisfactory they become conventions for the given 
situations and need not be reconsidered for each new experiment. 
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With the proviso that we keep track of our simplifying assumptions 
and correct them when necessary in the light of experience, the pro- 
cedures for defining stimulus sets are straightforward. If an experiment 
is assumed to involve only repeated occurrences of a single stimulating 
situation (more precisely, instances of a single class of situations), its 
formal representation requires only a single set S. If an experiment 
involves discrimination between two stimulating situations, these are 
represented by two sets Sı and Sz, which may have elements in 
common. 

There have been few parametric studies of stimulus compounding, 
so formal representations of such experiments are more definitely 
provisional. Each independently manipulable stimulus in an experi- 
ment is represented by a set of elements, but a combination of two 
stimuli cannot always be represented by the union (sum) of the cor- 
responding sets. If two stimuli belong to different modalities, then 
there is usually good reason to assume that a combination of the two 
can be represented by the union of the sets, but if two stimuli belong 
to the same modality, special problems (e.g., masking) may arise. The 
experiments so far conducted as tests of theoretical predictions on 
compounding [34, 78, 83] have used visual stimuli (signal lights or 
printed words) and the assumption of additivity has worked out satis- 
factorily. In the case of two auditory stimuli, however, we would not 
ordinarily expect combinations to be additive. 

Responses. So long as we are dealing only with behavior in a single 
stimulating situation, the representation of response variables is 
straightforward. For any unambiguous response classification provided 
by the experimenter, we can define a corresponding set of response 
classes. In a classical conditioning experiment, for example, we may 
represent by A; the class of behaviors which meet the criterion of a CR 
during a prescribed interval after onset of a CS and by A: the class of 
behaviors which do not. If two dependent variables are defined for a 
given situation, we may require two independent response classifica- 
tions. Suppose that running time and frequency of correct choices are 
to be recorded in a T-maze experiment. We first define, for all trials, 
two mutually exclusive and exhaustive classes, A1 and A», correspond- 
ing to running and not-running, respectively ;”* then for all trials on 
which the animal runs the maze, we define a second, orthogonal 


2 In order to derive quantitative predictions about running time, we conceive 
the maze path as a series of segments and define Aı as the response of running from 
One segment into the next. If there are such segments along the correct path, mini- 
mum running time will be the time required for k successive A, responses. Longer 
running times will occur if the A, responses are interspersed with one or more Ay 
responses, 
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classification of the runs into, say A, and Az, those which terminate 
in the left and right goal boxes, respectively. Whether or not the two 
dependent variables, running time and frequency of correct choices, 
are correlated is not a matter of definition; it is a matter to be pre- 
dicted by the theory and tested by experiment. 

Even transfer can be represented, up to a point, without any special 
difficulty. Transfer involves two or more situations, corresponding to 
which are stimulus sets 1, S2, etc. Suppose there are two situations and 
that we have defined the response classes Ai, 42, . . . , Ar in the first 
and A‘, Aj, ..., Al in the second. Now we can speak of transfer 
from the first situation to the second if any systematic change in the 
probabilities of 41, As, . . . , Ar which results from a period of learn- 
ing in the first situation is accompanied by some systematic change in 
the probabilities of 41, As . . . , Aj. Only if we wish to go further and 
treat not only transfer but positive transfer and negative transfer do 
we run into complications. When we speak of positive transfer, or 
generalization, we mean to imply that the same response which has its 
probability increased in Sı during learning will exhibit increased 
probability in S} upon a subsequent test. Thus we have not only to 
define response classifications in the two situations, but also to assume 
that the classification is the same in the two cases; if we have, say, 
reinforced response A, in Sı and propose to test for generalization by 
evaluating the probability of A; in S», we are assuming that the 
criteria used to identify the class of behaviors corresponding to Ai 
in the first situation are appropriate for identifying “the same re- 
sponse” (i.e., the response to which positive transfer will occur) in a 
second situation. In a few relatively well-standardized learning experi- 
ments—e.g., eyelid conditioning, simple verbal learning—appropriate 
criteria for response classification are established beyond much doubt. 
In other cases, e.g., discrimination learning [cf. 28, pp. 6-7; 35, PP- 
218-231], response criteria are highly controversial. 

A statistical model can offer no special sources of insight into the 
ambiguous cases; what it should provide is more effective means for 
testing hypotheses pertaining to response classification. Assumptions 
that prove fruitful can be incorporated into a broader theory which 
will prescribe the conditions of application of particular mathematical 
models. Up to the present time, little progress in this direction can be 
reported, however. In the early stages of development of statistical 
and probabilistic learning theories (as in the case of all previous 
quantitative theories), it has seemed necessary to restrict attention to 
experimental situations in which ambiguities of response classification 
are minimal. 

Reinforcing events. In order to complete the formal representation 
of a learning experiment, the possible experimentally distinguishable 
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outcomes of a trial must be classified according to their effects on 
response probabilities. A class of outcomes which produce increments 
in probability of response A; is represented by an element Z; in the set 
of “reinforcing events.” As in the case of stimuli and responses, our 
model has nothing to tell us about the nature of reinforcing events. 
It does, however, facilitate the formulation and testing of explanatory 
hypotheses which relate reinforcing events to other classes of events or 
operations. In any test situation, the role of an explanatory principle, 
e.g., the contiguity principle, is to generate a hypothesis specifying 
the empirical events which will reinforce a given response; the role of 
the statistical model is to generate testable predictions as to how learn- 
ing will proceed if the latter hypothesis is correct. Obviously no one 
test can be definitive. Repeated failures of prediction or repeated 
failures of an explanatory principle to generate testable hypotheses 
lead in time to rejection. The more common outcome, a mixture of 
successes and failures, indicates clearly enough that something is 
wrong, but not so clearly what should be done to improve matters. 
The rapid progress of reinforcement theory in recent years has been 
made possible by the fortunate circumstance that, as experimenters, 
we can identify a considerable number of experimental operations 
which serve regularly and reliably as reinforcing events even though, 
as theorists, we may not agree concerning the independently definable 
properties these reinforcers have in common. 

In the research program associated with statistical learning theory; 
persistent attempts have been made to effect an independent definition 
of reinforcing events in terms of contiguity-interference principles. The 
principle of conditioning by contiguity, taken alone, appears simple 
and straightforward enough: a response becomes conditioned to all 
stimuli temporally contiguous?? with the stimulus that evokes it. One 
might think, then, that a reinforcing event could be defined as any 
stimulus which originally evokes the to-be-reinforced response. But 
this definition specifies neither sufficient nor necessary conditions for 
an increment in probability of the response. It is not sufficient, for 
even though the response is evoked in the presence of the appropriate: 


23 The precise definition of “contiguous” poses difficult problems. Originally, fol- 
lowing the lead of Guthrie [46], I interpreted contiguity to mean strict temporal 
simultaneity. However, prolonged and strenuous attempts to develop a theory of 
reinforcement based on this interpretation have convinced me that no such theory. can 
be constructed without the addition of further assumptions. At present, my inclination 
1s to modify the interpretation of temporal contiguity in the direction of Hull’s stimulus. 
trace hypothesis [55, 56]. With this modified interpretation, we assume, in effect, 
that the response which occurs at any time becomes connected to any stimulus ele- 
ments sampled during a short preceding interval (of the order of 0.5 sec). More rigor- 
ous quantification of the “trace” interpretation is desirable but has not yet been worked 
out in detail, 
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stimulus, if a cue for some competing response occurs before the first 
stimulus is terminated, the newly formed association will suffer inter- 
ference. And the condition specified by the definition is not necessary, 
for a reinforcing event may consist in the termination rather than the 
onset of stimulation (as, for example, in the case of an escape learning 
experiment). 

Upon reviewing the many attempts to formulate simple, closed 
explanatory principles of reinforcement and the still more numerous 
attempts to test the formulations, I have been led strongly to suspect 
that no hypothesis as simple or unitary as those envisaged by con- 
temporary contiguity or drive-reduction theorists is going to prove 
adequate. It appears, rather, that associative learning depends upon a 
complex of causal factors or conditions. The experimental operation 
we refer to as a reinforcement for a given response in a given situation 
is one which supplies the factor or factors missing from the complex 
needed for acquisition in that situation. 

The two reasonably well-established principles of reinforcement 
derivable from contiguity theory may be summarized as follows. Other 
things equal, the likelihood that an experimental operation will 
reinforce a given response is directly related to (a) the probability that 
stimuli associated with the operation will evoke behaviors that do not 
interfere with the given response and (b) the extent to which stimula- 
tion effective at the time of evocation of the response is terminated 
immediately after occurrence of the response on each trial. These 
principles will enable us to make a priori predictions about the rein- 
forcing effects of events or procedures when, and only when, we are in a 
position to state precisely how the events or procedures will change the 
stimulus situation prevailing when the to-be-reinforced response occurs 
and precisely what behaviors will be evoked by stimuli occurring simul- 
taneously with or following that response. When we do not have the 
necessary information, we cannot make definite predictions, but we 
may still be able to formulate testable hypotheses. Finally, I should like 
to emphasize that I do not expect these two principles alone to pre- 
scribe adequately the necessary and sufficient conditions for reinforce- 
ment, except perhaps in a few of the simplest conditioning situations, 
even when we have enough information to apply the principles 
without ambiguity. 

Experimenter-determined variables. Once we have specified the set- 
theoretical representation of a learning experiment, we are in a posi- 
tion to prescribe the values of at least two and usually three of the 
theoretical variables listed in Table 4. The variable r will be the num- 
ber of response classes and also the number of possible reinforcing 
events; K will be the number of stimulus sets. The variable 7, 
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probability of reinforcing event Ep, is usually, but not always, experi- 
menter-determined. In a given experiment, Ep might, for example, 
represent a signal, unequivocably discriminable by the subject, indi- 
cating that response A; is correct. Then the value of Tz, probability 
that the signal is given on any trial, is prescribed by the experimenter. 
In the same type of experiment one might, however, present the 
signal indicating correctness of A, on every trial but make it so faint, 
or the “noise” level so high, that there is only probability Tx that the 
subject will detect the signal. Then m, would be a subject-controlled 
variable, and its value would have to be determined either from a 
psychophysical experiment or from the data of the learning experiment 
itself. 

The remaining items in Table 4 are all subject-controlled variables 
in the sense that their values must be ascertained from behavioral data. 
In order to complete the linkage between the theory and any particu- 
lar experiment, one must derive experimental laws which contain 
subject-controlled variables as dependent variables or parameters. 
Formally, these laws may be theorems especially derived for a par- 
ticular application of the theory or they may be general theorems with 
values of the experimenter-controlled variables appropriate to the 
particular application. In the next section, we shall illustrate the 
derivation of experimental laws for a number of important applica- 
tions and discuss the final step of estimating parameter values and 
comparing theoretical with observed relationships. 

Derivation of experimental laws. Deriving theorems from a 
mathematical theory may seem at first thought to be an almost 
mechanical process. So far as the actual derivations are concerned, 
this impression is quite correct. It is intended that all theorems, or 
laws, of the theory should be strict logical consequences of the assump- 
tions. Still, in practice the derivation of laws is far from mechanical, 
and in fact, may involve a high degree of art. From any mathematical 
theory an infinite number of theorems can be derived, but only a few of 
these are of any scientific interest. One of the artistic, or intuitive, 
functions of the scientist is to diagnose empirical situations and sense 
occasions on which a theory can fruitfully be applied. A second intui- 
tive function is to foresee clearly enough what a theory will have to 
say about a given empirical problem so that the appropriate mathe- 
matical derivations can be initiated. A third is to make simplifying 
assumptions that are necessary for any particular application of a 
theory without distorting the problem under investigation. 

In the remainder of this section, I shall sketch at a rather non- 
technical level some of the techniques of derivation used to generate 
important classes of experimental laws, and at the same time I shall 
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try to illustrate at least the third of the intuitive functions mentioned 
above. The presentation will necessarily involve more mathematical 
notation than the preceding portions of this article, and the reader 
with a low tolerance for symbols and equations may be well advised 
to skip ahead to the following section, which is concerned with more 
general matters of methodology. 

Relations among response measures. Theorems a and b under the head- 
ing Response model, above, are already in form for direct application 
to any situation in which probability of a given response is assumed 
constant (as might, e.g., be the case at the asymptote of learning). 
Taken together, they overdetermine the variable f;, and therefore, 
provide a consistency check on the theory. The first of these theorems 
states that the expected number of occurrences of a response A; in n 
trials is equal to np;. Since n is known for any experiment, once we 
have computed the observed mean number of occurrences of A; we 
can set this empirically determined quantity equal to np; and solve for 
p;- If, e.g., the mean number of occurrences of response A; in a block of 
50 trials were 35, our estimate of p; would be .70. Having estimated pj, 
we would be in a position to make a genuine prediction. According to 
the second theorem, the probability that A; occurs for the first time on 
the nth trial is equal to (1 — p;)"";. Using our estimate of p; ob- 
tained by application of the first theorem, we could predict the exact 
frequency distribution for “first occurrences.” If the estimate p; = .70 
in the numerical illustration were based on 100 blocks of 50 trials 
(using either the same subject or a group of subjects with like values of 
p;) we could predict that in .70 X 100 = 70 instances A; would occur 
as the first response in a block; in .30 X .70 X 100 = 21 instances A; 
would occur for the first time on the second trial of the block; and 
so on. 

The same two theorems provide a basis for deriving laws relating 
to latencies or response time. Consider, for example, a situation in 
which the subject makes successive choices between two responses, 41 
and As, that require hı and h sec, respectively, to execute; latency is 
defined as time from onset of the stimulus to completion of the first A1 
response. The probability of a latency of exactly L = (n — 1)ho + hı 
sec is equal to (1 — pı)""'9ı and it can easily be shown [19; 13, pp- 
311-313] that the predicted mean latency is 


nie (1 — pi) 
L=h RER + hy 


If hı = hs, this relation simplifies to 
ees 
pı 
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Time-dependent phenomena. Numerous functions describing changes 
in response probability during retention intervals follow directly from 
theorem a under Stimulus fluctuation model, above. According to this 
theorem, if the component of a stimulating situation corresponding 
to any given stimulus element has probability f(0) of being present in 
the situation (available for sampling) during a given learning trial, 
then it has probability f(/) of being present in the situation on a test 
trial after a rest interval t, where 


fd = J — U — fO) 


and J and a are parameters with values between zero and unity. 

Now consider a learning experiment involving a single stimulating 
situation; the latter is represented formally by a set Sx with subsets $ 
and S’ corresponding to the elements of Są which are and are not, 
respectively, available for sampling by the subject during any single 
learning period. Suppose that at the end of the first acquisition period 
all elements of S and no elements of S’ are connected to response Aj. 
Then f(0) equals unity and if all elements have equal sampling prob- 
abilities, f(2) will be equal to the mean proportion of elements in S 
connected to A; after time ¢ and therefore to p;(t), the probability of A; 
in the situation after time t. Making the appropriate substitutions in 
the formula of theorem a under Stimulus fluctuation model, we have 
the function 


p(t) = I) = J+ 1 — Jat 


describing the predicted course of spontaneous regression, or short- 
term forgetting. 

Now the dependent variable p;(¢) can be evaluated in any ap- 
propriate manner. We might, for example, test either the same sub- 
jects or different groups of subjects after different intervals ¢ and take 
the observed proportion of A; responses at each interval as an estimate 
of p,(t). Then by curve-fitting procedures we could determine, in turn, 
the values of the parameters J and a which produce best agreement, 
according to an appropriate criterion, between the empirical and 
theoretical curves of p;(?). 

Spontaneous recovery from extinction can be handled similarly. 
Suppose that conditioning has been continued over several periods 
until all elements in Sẹ are connected to Aj, and then extinction is 
carried out during a single period until p; goes to zero. At the termina- 
tion of extinction, all elements in S” but none in S are connected to Aj. 
Reasoning as in the preceding paragraph, we can set f(0) equal to 
zero in the formula of theorem a under Stimulus fluctuation model, 
and, substituting p;(t) for f(t), obtain for the predicted curve of 
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spontaneous recovery 


bi) = I a‘) 


Since the same parameters, J and a (both derived from j and j’ of 
Table 4), appear in the functions for both regression and recovery, it is 
possible to make quantitative predictions for one situation after the 
parameters have been evaluated in the other [25, 54]. Functions 
applicable to various other distributional phenomena have been 
derived from the statistical theory [25, 26, 40, 54] by the same general 
methods illustrated here. 

Stimulus manipulation. The simplest type of experiment on stimulus 
compounding provides the most direct means for evaluating sizes of 
stimulus sets. Suppose that we wish to compare the numbers of ele- 
ments, Nı and No, in the sets Sı and Sz corresponding to two stimuli 
which are assumed on empirical grounds to have no common elements. 
In the interest of simplicity it will be convenient also to suppose we 
are dealing with a situation for which it is reasonable to assume equal 
© values over both sets. Our first step is to give appropriate training 
to experimental subjects until probability of some response A; is 
brought approximately to unity in Sı and zero in Sy. (Successful 
achievement of this step will corroborate our assumption that the sets 
are nonoverlapping.) Then we will present the two stimuli together 
and evaluate the probability of A; in the presence of the combination. 
By theorem c under Stimulus-response relations, above, fj should be 
related to Mı and N, by the relations 


bj; = N 
= Nae 
N, 
and ERDAS NEN; 
from which it obviously follows that 
BE UN 
= NS 


Substituting observed values for p; and 1 — pj, we have a numerical 
estimate of the relative size of sets S$; and Sy. 

This technique can be used to prepare a rigorous quantitative test 
of our stimulus model. By repeated application of the technique, we 
select a set of nonoverlapping stimuli for which the corresponding sets, 
Su S2 >- + Sx, are all of the same size, i.e., 


Mi=N,= +++ =Ng=N 
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Then by appropriate training [see 83 for an account of an actual ex- 
periment of this type] we bring probability of a response A; to ap- 
proximately unity in the presence of some of the stimuli, say the odd- 
numbered ones, and to zero in the presence of the remainder. Now by 
applying theorem c under Stimulus-response relations we can make 
exact a priori predictions of response probabilities in the presence of 
any desired combination; e.g., given a combination of S1, S2, and Ss, 
Ni+ N; AND 


TEEN a a 


or given the combination Si, S2, Sa, Ss, Ss, 


Nı -iN 
P= MENNEN +N 5N 
and so on. 

For a second example of predicting consequences of stimulus 
manipulations, let us consider an experiment on stimulus generaliza- 
tion. Suppose that we condition a response A; to a stimulus, say a light 
of wavelength 500 my. Assume that the probability of A; in the pres- 
ence of the CS is zero before conditioning and unity at the end of 
conditioning. Now suppose that we test with lights of 520 and 540 mp 
and obtain .90 and .60 as empirical estimates of p; in the presence of 
the two test stimuli. Assuming that the stimulus set S500, corresponding 
to the CS, comprises N elements all of which were connected to A; at 
the end of conditioning, we conclude that the sets S529 and S540, cor- 
responding to the test stimuli, have 90 and 60 per cent, respectively, 
of their elements in common with S500. Having established these cor- 
respondences between empirical and theoretical variables, we are 
ready to test the model by making some predictions. We might, for 
example, replicate the experiment with new subjects but this time 
carry conditioning only far enough so that p; = .5 in the presence of 
the CS. We could predict by application of theorem ¢ under Stimulus- 
response relations that the probabilities of A; in the presence of the 
test lights should be for S520, 


pj = 5 X .90 


45 


and for S540, 
p; = 5 X 60 = 30 


Although this example is purely hypothetical, the predictions do not 
seem implausible in the light of data reported by Guttman and 
Kalish [49]. 

Simple learning. For purposes of theoretical treatment the simplest 
learning experiments are those involving two alternative response 
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classes and determinate reinforcement. By the latter we mean that a 
reinforcing event corresponding to one or the other of the defined 
response classes occurs on every trial, and that we can specify in 
advance of a trial the probability that any given response will be 
followed by any given reinforcing event. 

From our previous discussions of reinforcement and contiguity 
theory it follows that acquisition of a runway response with 100 per 
cent reinforcement should represent one of the simplest determinate 
cases. The animal’s behavior during any time interval is categorized 
into two classes, running and not-running, and the former is reinforced 
at the termination of each experimental trial. If we make the simplify- 
ing assumption that all stimulus elements in the set corresponding to 
the situation which begins each trial have equal sampling probabilities 
(“equal © case”), then by theorem b under Learning, above, the 
probability of running changes from one trial to the next in accordance 
with the relation 


fn = (1 — O)Pn + 0 


As it stands, this difference equation is not convenient for experimental 
application; however, we can prove by mathematical induction that if 
the difference equation holds for any pair of successive trials, then over 
a series of trials response probability must change according to the 
formula 


Pr Eh i) EO) 


Now by substituting for p„ in this function its equivalent in terms of 
mean latency from the relation L = h/p given in Part II above under 
Formulation of a Probabilistic Response Model, we finally obtain a 
function 
Ta he h 
"Mm 1—(1 —4/L)0 = or 


suitable for experimental application. The values of h and © can be 
determined by curve-fitting procedures from an empirical curve of 
mean latency against trials. With estimates of h and © in hand, we can 
then generate independent tests of the theory by deriving other func- 
tions containing these parameters to predict changes in variances or 
other statistics of the latency distribution during the acquisition series. 

A second example of determinate reinforcement is provided by the 
verbal conditioning experiment. In the two-choice version, the sub- 
ject responds to the ready signal on each trial by predicting which of 
two reinforcing events, Eı or Es, will occur. Again making the equal O 
assumption, we can apply theorem b under Learning, above, to obtain 
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the expected probability of Aı (the response of predicting Eı) on trials 
following occurrences of E, or Ey: if E, occurs on trial n — 1, 


pin = (1 — Opini + O 
if Ey occurs on trial n — 1, 
Pin = (1 — Shin 


where, as usual, pı,» represents probability of Aı on trial n. From these 
difference equations, or “trial operators,” we can predict that the 
probability of Aı will increase following reinforcement of A; and de- 
crease following reinforcement of As, but we are not yet ina position to 
make any predictions about the over-all course or limit of learning 
under any particular reinforcement schedule. 

Starting with the trial operators, we can, however, make a start in 
the desired direction by deriving a difference equation for the average 
change in response probability per trial under any given conditions. 
Suppose the design of an experiment prescribes noncontingent, ran- 
dom reinforcement with E; and E, having probabilities m and 1 — m, 
respectively, on any trial. Then to obtain the average change in 
probability of A; on a trial, we need only weight each of the trial 
operators by its probability of application and add them, viz., on the 
average, 

Pun = Ti — Opin + O] + A — mA — Obrni 
= (1 — O)pin—1 + Or 
It can be shown by mathematical induction that if this equation is ap- 
plied repeatedly the resulting curve of probability against trials ‘is 
given by 


Pin =T — (r — pia) — 8)" 


a negatively accelerated function which has m as an asymptote regard- 
less of the initial probability p1,ı or of the value of ©. As we have indi- 
cated in Part II, considerable research has been directed toward 
determining experimental conditions for which curves of learning 
under new reinforcement schedules are predictable once © has been 
evaluated from data obtained under some one schedule. 

_ In more complicated cases, e.8., those in which probability of a 
given reinforcing event on any trial is contingent upon the response 
made on the same or a preceding trial, or those in which probabilities 
of reinforcement vary in some systematic manner from trial to trial, 
the procedure for deriving experimental laws is essentially the same as 
that illustrated here for the simple noncontingent case [see 30 for 
details]. 


468 w. K. ESTES 


Discrimination learning. For the simplest discrimination experiments, 
ie., those characterized by determinate reinforcement and by con- 
trolled stimulus exposures which are independent of the organism’s 
behavior, we should be able to derive predictive laws from the theory 
already presented. Suppose a discrimination is to be formed between 
two stimulus situations, S; and Sy, which are presented in a random 
sequence with equal relative frequencies. Response Aı is to be rein- 
forced whenever Sy is present on a trial and As whenever Sy is present. 
To conceptualize this experiment we require three stimulus sets: Si 
representing stimulation available only in Sy; Ss representing stimula- 
tion available only in Sy; and $, representing stimulation available on 
all trials. In the terminology used by a number of recent investigators 
[see, e.g., 67, 80] Sı and Sz represent relevant cues and Se irrelevant 
cues. For simplicity, we shall assume equal © values for all elements in 
all three sets; therefore, the probability that an element in Se will be 
sampled on any trial is equal to O, and the probability that an element 
in 5; or S will be sampled on any trial is equal to 9/2. Now beginning 
with theorem b under Learning, above, we can proceed as in the 
preceding section and obtain expressions for the proportions of ele- 
ments in 5}, Se and Ss, respectively, that are connected to Aı after 
n — 1 discrimination trials: 


fin = 1 — 0 — pia) (1 == ey 
Dan = 1 = (% — pad — 0)” 
and pain = (Pau) (1 = ey" ; 


where /j;,n is to be read “proportion of elements in set S} connected to 
A; on trial n.” In order to predict response probability in either Sy or 
Six, we must take into account the sizes of the three stimulus sets, V1, 
N., and Ns. It will be convenient to introduce the abbreviations 


Ne 
: Ni +N. 
N: 
and ee ee 
"T Na +N, 


for the proportions of “relevant cues” in the two situations. Now ap- 
plication of theorem c under Stimulus-response relations yields the 
desired expressions for probability of A; in situations Sy and Sry as 4 
function of number of trials: 


Pin = Wipirn + (1 — wi)pern 
and buin = Wopo, + (1 — we)Pei,n 
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If the initial values Pı1,1, 21,1, and f.ı,ı are all assumed equal to 
¥, these expressions reduce to the simple, negatively accelerated 
“growth” and “decay” functions 


«206-9 


Pun 


Prin = 


Die Nie 


Thus it turns out that the predicted asymptotes, but not the slopes, 
of discrimination learning curves depend on the proportions of com- 
mon elements in the stimulus sets. As the proportion of common ele- 
ments tends to zero, wi and ws tend to unity, and the curves of prı,n and 
pr, tend to asymptotes of unity and zero, respectively. Empirical esti- 
mates of the parameters w; and w» are obtainable directly from the 
observed asymptotic relative frequencies of A; responses in Sy and Shy, 
respectively. Once the parameters have been evaluated for a given 
situation, we can derive a priori predictions for learning under modi- 
fied conditions (e.g., with unequal relative frequencies of presentation 
of Sy and Su or with partial reinforcement in one or both situations). 


METHODOLOGY 


Stages in theory construction. Having just emerged from the 
rough-and-ready business of everyday theory construction long enough 
for the analytical interlude of the preceding pages, I find the tidy and 
formal appearance of the system outlined in Part III under Formal 
Structure of Statistical Learning Theory as unfamiliar as that of a 
small boy in a Sunday suit. The formal structure was not blueprinted 
in advance; hence it must have evolved via a series of approximations. 
Reviewing the semihistorical account given in Part II, and imposing a 
certain measure of abstraction and idealization on the disorderly flow 
of events, we can distinguish the following phases in the cycle of opera- 
tions that has generated the present theory. 

Exploration. As the statistical approach attempts invasion of each 
new empirical area, we have to explore alternative modes of descrip- 
tion and measurement until we succeed in isolating simple empirical 
relations which exhibit symptoms of reproducibility and generality. 
The rapid growth of statistical learning theories from about 1948 has 
been possible only because these exploratory analyses were essentially 
completed for a number of empirical situations by reinforcement 
theorists, especially Skinner and Hull, during the preceding decade. 
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Conceptualization. Once promising empirical variables have been 
identified, we require a conceptual representation which will enable us 
to use mathematical reasoning in working out the consequences of 
assumptions about relations among the variables. The general method 
of conceptualization we have found fruitful is to represent collections 
of empirical events by mathematical sets and empirical relations by 
relations and functions on the sets. 

Axiomatization. The best constructs do not make a theory until they 
are interrelated by definite assumptions. Not all the assumptions on 
the roster are on an equal footing at any given time, however. Always 
some have merely the character of tentative hypotheses that will be 
dropped at a moment’s notice if they fail to produce. Others have sur- 
vived enough trials and tests to acquire tenure and will be the last to 
be relinquished when the theory runs into factual difficulties. The as- 
sumptions with most seniority are regarded as axioms when we turn to 
structural analysis of the theory, but we do not ordinarily attempt to 
achieve quite the degree of formalism that the logician associates with 
axiomatization. 

Although there can be no limitations in principle on the sources of 
a theorist’s hypotheses, strategic considerations may lead to some 
restrictions in practice. The investigators who have had a part in the 
development of statistical learning theories share a strong distrust 
of speculative theorizing. Like many of our fellows, we find it difficult 
enough to test theories that arise in close conjunction with experi- 
mental analyses, and usually beyond our capacities to test those that do 
not. A distaste for untestable theories does not, however, by any means 
consign us to a permanent role as passive followers of paths set by the 
experimentalist. When formulating theoretical assumptions we prefer 
to begin with severely limited hypotheses, often representing no more 
than simple smoothing and extrapolation of observed relations among 
empirical variables. After a theory has begun to take shape, we expect 
to derive predictions of new empirical phenomena and then have our 
turn at setting the pace for the experimenter. 

Mathematical analysis. Once assumptions have been stated in mathe- 
matical form, it is necessary to analyze the formal system with respect 
to internal consistency and fruitfulness in generating a multiplicity 
of relationships among the theoretical variables. The end result of this 
phase, often attained only after many revisions of the formal system, is 
a body of theorems which, once appropriately interpreted, may prove 
to be experimental laws. Now we have a model, i.e., a theory that 
describes a class of idealized situations in which only the variables of 
the theory play a role. The question remains how well the model will 
approximate actual experimental situations. 


ao 
= 
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Empirical test. The empirical facts or problems that give rise to a 
particular line of theorizing usually mark also the point to which we 
return for preliminary assay of the theoretical product. We do not, 
however, consider that we have really tested a theory until we have 
established some reasonably firm rules of correspondence between 
theoretical and empirical variables and then have gone on to predict 
new empirical phenomena. 

Exploitation. Once the theory has begun to acquire some empirical 
standing, it can begin to serve us as a theory should. We always see 
facts in the light of theories and always hope to see them more clearly 
with better theories. A useful theory not only guides experimental 
designs and interpretations of data, but also provides machinery for 
formulating and testing new hypotheses. 

Generalization and extension. One of the distinctive characteristics 
of quantitative theories is a constant pressure in the direction of in- 
creasing generality. Each restriction and simplifying assumption 
imposed at a given stage of development serves as a challenge to 
further investigation. Whenever a restriction is removed, the outcome 
is a broader theory which includes its predecessor as a special case. 
A second medium of extension and generalization is the incorporation 
into the theory of hypotheses which have been formulated in terms of 
the theoretical model and confirmed by subsequent experimental tests. 

Of necessity, statistical learning theories have had their empirical 
origin in the simpler and better-analyzed experimental situations. It is 
only in these situations that one can hope to make apt initial choices of 
empirical variables and relationships. Further, it is likely that the 
Statistical approach to learning theory will continue indefinitely to 
devote considerable attention to the most elementary forms of learning. 
One of the reasons for this prognosis is the recognition that empirical 
simplicity is relative to the current degree of refinement of experi- 
mental and theoretical methods. What appear at the moment to be 
extremely simple learning situations become continually less so as they 
are more delicately probed with the aid of quantitative theories. 

Naturally we are eager to extend our theories to more complex 
situations whenever tactical circumstances permit. When a complex 
Situation can be considered to be compounded of simpler ones, the 
extension of the theory can often be accomplished with little modifica- 
tion. Thus, we have treated discrimination learning as a resultant of 
two or more concurrent acquisition processes and paired-associate 
learning in terms of concurrent discriminations. When complex forms 
of learning resist analysis into simpler components, we may find it 
Possible to account qualitatively for certain aspects of the learning by 
means of general laws which have been established in simpler situ- 
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ations. We may, for example, hope to clarify distributional phenomena 
even in complex learning experiments by application of our stimulus 
fluctuation principles. Retention from trial to trial or period to period 
should be affected in qualitatively predictable ways by stimulus fluc- 
tuation regardless of particular experimental arrangements. Finally, 
there is the possibility, as yet relatively unexplored, that the same 
methods used in developing present statistical theories of elementary 
learning may be applied directly to the construction of theories for 
more complex variables and relationships. 

Formal organization. Our cross-sectional analysis of statistical 
learning theory in its present form reveals numerous points at which 
ambiguity or indeterminacy might enter into the system: in the data- 
language descriptions of empirical variables, in the definitions of set- 
theoretical concepts and theoretical variables, in the correspondence 
between theoretical concepts and empirical variables, in the estima- 
tion of parameter values. We should like to eliminate all traces of 
indeterminacy from the system, but that is too much to hope for. 
Our descriptions and analyses of empirical situations are never com- 
plete or final; therefore, at any stage of a scientific investigation there 
is a residue of uncertainty which must be reflected in our theoretical 
system. 

The strategy adopted for dealing with uncertainty and ambiguity is 
one of the distinctive aspects of the present approach to learning 
theory. One frequently hears the argument that so long as an empirical 
science is in a primarily exploratory stage, theories must be informal 
and qualitative. I do not question that informal and qualitative theo- 
rizing is sometimes necessary and even rewarding, but I do have 
doubts as to both the necessity and the wisdom of being long satisfied 
with it. The disadvantage of permitting a haze of ambiguity to cover 
an entire theory is that the theorist, like anyone trying to navigate in a 
fog, can never really tell how far he has come or whither he is heading. 
Although we cannot get rid of ambiguity entirely, we can localize it by 
making our theoretical concepts and assumptions precise and permit- 
ting indeterminacy only in the correspondences between theoretical 
and empirical variables. In the early stages of theory construction, 
it may still be impossible to localize the sources of an erroneous predic- 
tion, but as rapidly as we accumulate instances of successful prediction 
and description in any empirical area, we become committed to pat- 
ticular rules of correspondence, or coordinating definitions, and the 
entire theory becomes determinate. 

It will be clear from our analysis of the present theory that all 
linkages among constructs are explicit and determinate and all 
derivations of theorems are accomplished by exact mathematical 
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reasoning.”* Interpretive rules, on the other hand, are somewhat open 
ended. We can not, for example, say once and for all what kinds of 
events or operations will serve as reinforcing events. But in all the 
experimental situations that have been used for empirical tests of the 
theory we have had to make specific assumptions as to which opera- 
tions correspond to reinforcing events, and if we wish to retain 
the ground we have gained, we must remain committed to these 
correspondences. 

I would like to emphasize that my brief for rigor in theorizing does 
not imply any great love of formality for its own sake. In the develop- 
mental stages of a science it is not healthy for theories to stand still 
long enough for exhaustive logical analysis. The kind of formalization 
I consider necessary to sound theory construction consists in progres- 
sively sharpening the definitions of concepts and exposing concealed 
assumptions at the same time that the theory continues to undergo 
correction and refinement in the light of experimental applications. 

Mensurational and quantificational procedures. Presentations 
of even the most formal and quantitative learning theories do not 
include discussions of the logic of measurement. To one with a back- 
ground in mental testing or traditional psychophysics it may seem 
that the learning theorist must either be shirking his scientific responsi- 
bilities or else be possessed of some unfair advantages over workers in 
those fields, I believe that neither is the case. The advantageous posi- 
tion of the learning theorist arises from the fact that he has not been 
lured into searching for elaborate scales of measurement before the 
development of quantitative theories. 

There may, of course, be differences in usage of the term measure- 
ment. By it I mean the coordination of an empirical system to a particu- 
lar type of mathematical system? within the framework of a theory. In 
the case of fundamental physical measurement, the theories involved 
are so firmly accepted and so inextricably imbedded in our habits of 
thinking about nature that reference to theories is usually suppressed. 
In the case of derived physical measurements, €.8., those having to do 
with heat, electrical conduction, the theoretical reference remains 
explicit; and the generally healthy condition of the theories is reflected 
in the associated scales of measurement. The severe growing pains of 
psychological measurement are largely attributable, I would suggest, 
to unfortunate early experiences in the house of dualism. Once it has 


24 A strict axiomatization of a portion of the present theory was completed by 
Professor Patrick Suppes and the writer during our period in residence at the Center 
for Advanced Study in the Behavioral Sciences. This study is in preparation for 
publication. 

25 An elucidation of the properties of this mathematical system has been given by 
Campbell [14]. 
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been assumed that there must be psychological scales, or dimensions 
of measurement, paralleling physical scales, what is more natural than 
to carry out in a psychological testing situation some operations that 
yield numbers and assume that the numbers represent measurements 
on a “psychological scale.” This accomplished, the way is open for 
endless discussion of subtle mathematical questions whose answers 
have no empirically testable consequences and which are related in no 
direct way to the development of substantive theories. 

Investigations of learning begin with measurements of empirical 
variables but without an a priori assumption that the numbers ob- 
tained really represent measures on psychological scales. The investi- 
gator who measures the latency of a runway response in a learning 
experiment is not operating under the assumption that there must be 
scales of “psychological time” and “psychological distance” cor- 
responding to ordinary physical time and distance; therefore, he is not 
immediately confronted with such questions as whether he is justified 
in using a ratio scale or whether his units are additive. The investigator 
may still become embroiled in logical difficulties by making the as- 
sumption that he is really measuring “learning,” but modern learning 
theorists usually avoid this trap also. In the reinforcement theories of 
Hull and Skinner and in the statistical theories of the writer and others, 
one bypasses the hopeless problem of defining some unitary measurable 
variable to serve as a referent for the term “learning” and proceeds 
directly to relate such physically measurable behavioral variables as 
latency to physically measurable independent variables by means of 
laws or theories. 

Once a theory has begun to take shape, measurement is a matter of 
assigning numerical values to the variables and parameters of the 
theory. The problems of measurement may still be difficult enough, 
but they are technical, not philosophical. In the case of statistical 
learning theories, the one special problem of measurement is the prob- 
lem of statistical estimation. Having cast our methodological lot with 
the statistical theories of other disciplines, we share not only their 
problems, but also their methods and results. Also, new problems of 
statistical estimation arise in learning theory that have not been solved 
elsewhere, and the solution of these is becoming an important area of 
theoretical research [see 13, 72]. 


RANGE OF APPLICATION 


Numerous questions can be asked about the empirical scope and 
fertility of statistical learning theories. Probably the reader will have 
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been able to derive his own answers to most of them from the material 
given in Part II of this report. Here I shall limit myself to brief sum- 
mary comments on some of the specific questions raised in the Project 
A outline. 

I would prefer to define the present scope of the system, not in 
terms of the problems for which it is always right, but in terms of the 
area in which it is presently undergoing development in association 
with experimental researches. So defined, the area of application is 
roughly coextensive with that in which the experimental analysis of 
learning has reached the stage of yielding data suitable for quantita- 
tive treatment. I say “roughly” because, on the one hand, some prob- 
lems in this area have not so far been touched by the statistical ap- 
proach and, on the other, predictions derived from statistical theories 
often run ahead of available experimental data. Generally speaking, 
applications of statistical learning theory in its present form are limited 
to situations in which measures are taken on well-defined response 
classes and in which reinforcing events can be identified. Among the 
standard experimental situations dealt with so far are classical and 
instrumental conditioning, runway and T-maze learning, simple 
discriminative learning, simple verbal learning in Thorndikian, 
paired-associate, and guessing experiments, simple escape and avoid- 
ance learning. Within these situations, only restricted sets of variables 
have been treated. For example, considerable progress has been made 
in dealing with random reinforcement schedules, but only a bare start 
in dealing with magnitudes of reinforcing stimuli; relationships in- 
volving stimulus variability have been formalized but those involving 
stimulus traces have not. In nearly all cases, tests of theoretical predic- 
tions have been most successful under conditions of temporally spaced 
trials. Under conditions of extremely massed trials or continuous work, 
complications enter that have not been analyzed in terms of statistical 
models. y 

Rigorous empirical tests have been carried out only in a very 
restricted set of experimental situations. Statistical learning theorists 
are not the first to find that the assumptions underlying quantitative 
tests of a theory are rarely satisfied except in experiments carried out 
especially for the purpose. This state of affairs is bound to continue. 
What we hope for, and in some instances have reason to expect, is that 
once particular concepts and principles of the theory become estab- 
lished by rigorous tests in special experiments, they will prove useful 
aids to the interpretation of empirical phenomena in a wider range of 
situations, perhaps in some cases extending even beyond the laboratory 
walls. 
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EVALUATION OF THE SYSTEM 


One function always expected of a theory is to replace the detailed 
descriptions of fact provided by the experimenter’s protocol with 
descriptions which, though less complete, are more compact and more 
useful. How well this function is served depends in part upon how well 
the laws derived from the theory fit the facts. In the case of a quantita- 
tive theory, it is usually easy to decide whether predictions are in 
agreement with data (or at least whether there is significant disagree- 
ment) but often not so easy to decide whether an experimental 
application has resulted in confirmation or refutation of the theory. 

Quantitative theories are occasionally completely wrong but they 
are rarely completely right. Among the researches cited in Part II, 
we have seen numerous instances in which statistical learning theory is 
superior to any competitor in handling a particular kind of data, but 
still falls short of what we would like to achieve with it. Each step for- 
ward in our coverage of empirical data brings us at the same time to 
new problems and a new view of what remains to be accomplished. 
Thus the first applications of the theory to simple selective learning 
under random reinforcement were intended to test predictions con- 
cerning curve forms and asymptotes. The results in these respects left 
little to be desired, but they immediately suggested the possibility of 
evaluating the parameters of the system under one set of experimental 
conditions and then making entirely a priori predictions of behavior 
under other conditions. 

Some of these more exacting tests of the theory have led to accurate 
predictions of quantitative relationships and some have not. Should the 
entire set of findings be taken as evidence for the theory or against it? A 
simple balance sheet would not be particularly illuminating. We can 
say that in each of the empirical situations to which it has been applied, 
the theory has generated accurate descriptions of some, at least, of the 
principal trends in the data. Perhaps more important, the disparities 
between theory and observation have led to more detailed experi- 
mental analyses and, in some cases, to new empirical distinctions and 
classifications which appear to further the interpretation and organiza- 
tion of learning phenomena. 

For the experimentalist, evaluation of the theory comes down to 
evaluation of the research instigated, and organized by it. Lacking any 
generally accepted standards for this kind of assessment, I have tried 
to report in Part II, as objectively as my personal involvement would 
permit, the history of the theory in mediating research. This account 
may permit one who has also the benefit of hindsight and detachment 
to provide the value judgments. By way of summary, I shall simply list 
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here by topics the chief lines of empirical research instigated by the 
theory up to the present time: 


1, Effects of competing responses [20], magnitude of reinforcement 
[22], and deprivational conditions [6] upon curves of operant con- 
ditioning (bar pressing in a free responding situation) 

2, Effects of stimulus variables upon curves of classical condition- 
ing [5] and verbal conditioning [11] 

3. Conditions of constancy and modifiability of acquisition and 
extinction rates [27, 36, 64, 66] 

4. Resistance to extinction in relation to partial reinforcement 
under conditions of widely spaced trials [92, 93] 

5. Forms and asymptotes of acquisition curves in relation to con- 
ditions of random reinforcement and nonreinforcement [1, 2, 37, 65, 
76] 
6. Effects of new random reinforcement procedures: probability of 
reinforcement variable over trials [31], probability of reinforcement 
contingent on outcomes of preceding trials, probability of reinforce- 
ment controlled by ambiguity of the reinforcing stimulus?® 

7. Extensions of the treatment of random reinforcement in simple 
human learning to escape learning [88, 89] and avoidance [8] situa- 
tions 
8. Simple human discrimination learning in relation to stimulus 
variables and partial reinforcement (17, 31, 33, 34, 44, 63, 79] 

9. Quantitative prediction of stimulus compounding [34, 39, 78, 
83] 

10. Spontaneous recovery, regression, 
phenomena in relation to number and spacing O 
extinction trials [36, 40, 54, 66, 69] 

11. Quantitative analyses of paire 


verbal responses [3] 
12. Learning in elementary forms of social interaction, €.8:, two- 


and related distributional 
f acquisition and 


d-associate and serial learning of 


person games [4, 31] 


This list has been organized in terms of research cited in Part II of this 
report and associated with the Indiana branch of statistical learning 
theory. Among the studies associated with related statistical or proba- 
bilistic models are those of Bush and Mosteller and their associates on 
parameters of learning functions under partial reinforcement and 
learning in simple social situations [13, chaps. 12, 13]; those of Restle 
on the role of relevant and irrelevant cues in discrimination learning 

26Estes, W. K., & Johns, M. Probability-learning with ambiguity in the rein- 
forcing stimulus. Amer. J. Psychol., 1958, 71, 219-228. 
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[80]; and those of Wyckoff on “observing responses” in discrimination 
learning [95]. 3 

What specific aspects of statistical learning theories have been re- 
sponsible for the research instigated by them? One of the most critical 
features is probably a matter of timing. Unlike some of their predeces- 
sors, these theories have taken root in areas where the way has been 
prepared by considerable experimental analysis and the isolation of 
quantifiable empirical variables. The experimental spadework has 
provided both a tissue of empirical relationships from which promising 
theoretical assumptions could be extracted and also possibilities of 
prompt empirical testing for the embryo theories. The quantitative 
character of the theories tends in itself to facilitate research. The exact 
and specific predictions generated by them call irresistably for ob- 
servational tests. When experimental outcomes confirm the predic- 
tions, the investigator has reason to feel rewarded; he has won a point 
in his running game against nature. Disparities between quantitative 
predictions and data are momentarily less satisfying than successful 
correspondences, but in a way more important for the long-term 
fruitfulness of the system. If one restricted applications of a theory to 
problems for which successful outcomes were highly probable, a period 
of unadulterated pleasure might ensue, but in time the system would 
grind to a halt. Although maintenance of the investigator’s morale 
requires a sprinkling of happy outcomes, the theory is nurtured more 
by its errors of prediction. 

Associated with any fruitful theory are constant attempts to refine 
and extend the network of predictive relationships. A successful ap- 
plication consists in collecting and processing information as prescribed 
by the theory, i.e., evaluating theoretical variables, and then correctly 
predicting a relationship among empirical variables. When in any 
instance this much has been accomplished by application of the 
statistical model, the next step has been to ask whether, using the 
same information, i.e., the same parameter values, further predictions 
could be generated relating the given variables to others or specifying 
relationships among the variables in new situations. 

Sooner or later, these attempted extensions of the predictive net- 
work lead to disparities between theory and data. And at this point 
again, strategic considerations are important. One must yield to the 
facts, but not too readily. Our increasing confidence in the basic 
soundness of the statistical approach is not only a result but also a 
cause of its fruitfulness in leading to new lines of research. Our initial 
response to an erroneous prediction is not immediate modification of 
the theory, but continued experimentation, with the expectation that 
in each new situation suitable analyses will enable us to distinguish 
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conditions under which the theory does and does not hold. Isolation of 
such conditions may provide the starting point for revision or exten- 
sion of the theory. As we have seen in a preceding section, the history of 
the present system has been structured largely by a series of these 
critical situations in which a mixture of successes and failures of predic- 
tion has led both to new experimental analyses and to modifications of 
the formal model. 

At the time of writing, we can distinguish a number of key prob- 
lems which represent foci of current research efforts and whose resolu- 
tion will probably determine the next lines of theoretical development. 
One of these problems arises from the ubiquitous tendency for param- 
eters representing rates of learning to vary systematically with 
probability of reinforcement and with previous learning history. A 
second is associated with stimulus overlap in discrimination experi- 
ments, a third with the complex pattern of dependencies between 
effects of nonreinforcement and other experimental conditions. More 
could be enumerated, but I believe that if even these three problems 
could be added to those already handled effectively, we would have 
something bearing more than casual resemblance to a workable 
descriptive theory of elementary learning. 

The various components and aspects of statistical learning theories 
that we have reviewed differ widely with respect to probable longevity. 
Particular mathematical forms and formulas are likely to be revised 
many times, subject, however, to the restriction that the well-con- 
firmed experimental laws derivable from the theory in its present 
form should appear as approximations or special cases in successor 
theories. Some of the more general features of the present system, 
e.g., the probabilistic response model, the view of learning as a sam- 
pling process, the method of defining and representing reinforcing 
events, the exploitation of contiguity principles as a source of hypoth- 
eses relative to conditions of reinforcement and nonreinforcement, the 
assumptions of additivity and transposability of stimulus-response 
relations, may be expected to weather many seasons. 

In some instances, concepts and methods developed within the 
present system may prove useful in other areas of application. Espe- 
cially likely candidates for this interdisciplinary function are the 
techniques of representing classes of empirical variables by mathe- 
matical sets, the probabilistic methods of deriving relations among 
response measures, the general procedure for formulating basic 
assumptions in terms of difference equations which represent the 
momentary effects of causal variables and then from these deriv- 
ing descriptive laws for particular combinations of experimental 
conditions. 
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INTERRELATIONS WITH OTHER SYSTEMS 


Classical learning theories. It does not seem possible to make any 
direct comparison between the statistical approach to learning theory 
and that of such “field” or “cognitive” theorists as Tolman, Lewin, 
or Kohler. The two approaches are characterized by different descrip- 
tive frames of reference and different immediate objectives; and there 
is almost no overlap in problems investigated, methods, or results. 
It is possible that some of the methods of statistical learning theories 
may prove applicable to empirical problems that have interested 
cognitive theorists, but there is at present no real basis for prognosti- 
cating the forms or probable outcomes of such applications. 

The present theory embodies many features of Skinner’s “descrip- 
tive behaviorism,” especially the emphasis on descriptions of learning 
in terms of functional relationships between experimentally defined 
classes of behavioral and environmental events and the operational 
treatment of reinforcement. I would consider the most important 
difference between the two approaches to be one of emphasis. Al- 
though he speaks of response probability as a systematic dependent 
variable, Skinner has preferred to concentrate his research almost 
entirely on the formulation of laws that can be shown to hold uni- 
formly for individual organisms in highly reproducible situations. 
Investigators associated with the present system are interested also in 
statistical regularities which can be demonstrated only by averaging 
data over groups of subjects or over repeated experiments with in- 
dividual subjects. When there is a choice, all would agree, I think, that 
laws which provide unerring predictions for individual cases are the 
most satisfying. However, there is no guarantee that the choice is 
always open to us. In psychology, as in other fields of science, there 
may be important predictive relationships that can be studied only at a 
statistical level of analysis. And I have been able to find no tangible 
empirical basis for anticipating that with progressive refinements of 
experimental control, probabilistic laws of behavior will always con- 
verge to deterministic ones. 

A number of ideas which have played important roles in the de- 
velopment of statistical learning theory bear obvious signs of Guthrie’s 
influence. These include especially the stimulus-response contiguity 
principle and the conception of gradual learning as an outcome of all- 
or-none laws operating on many components of a situation. To what 
extent the utilization and modification of these concepts within the 
present system are in harmony with Guthrie’s thinking I shall not try 
to decide. The quantitative methodology of statistical learning theory 
has so little in common with the informality and intuitive emphasis of 
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Guthrie’s approach that comparative analyses for any but historical 
purposes are apt to be unrewarding. 

Whether statistical learning theory should be regarded as either a 
competitor or a successor to Hull’s system is hard to say. The statistical 
approach grew out of attempts to refine and quantify certain aspects of 
contemporary reinforcement theory, and the contributions of Hull 
and his followers to the latter are too well known to require comment. 
At present, the empirical scope of statistical learning theory is much 
more restricted than that envisaged by Hull’s quantification program. 
On the other hand, I believe it can fairly be said that in the common 
areas of application, the statistical theory is more determinate and 
testable. In any case, it should be possible within a reasonable time to 
determine which approach is the more advantageous as a research 
instrument and as a basis for further theory construction in the com- 
mon areas. As I have indicated elsewhere [28], my own forecast for 
the immediate future is that Hull’s system, together with the other 
broadly programmatic theories of recent decades, will be superseded 
by a loosely related federation of more restricted theories, statistical 
or otherwise, with the basic frame of reference and descriptive vocabu- 
lary of stimulus-response reinforcement theory serving a unifying 
function by providing a common connective link between model 
builders and experimenters. 

Stochastic models. The stochastic model developed by Bush and 
Mosteller [12, 13] is similar in many respects to the statistical model 
for acquisition phenomena described in this paper. The communalities 
are not entirely accidental, for those investigators and I have kept in 
relatively close communication, with numerous exchanges of methods 
and results over a period of years. It will be instructive to compare the 
two programs, bringing out points of similarity and difference with 
respect to methodology, empirical scope, and types of mathematical 
models employed. 

Comparisons among contemporary quantitative learning theories 
are facilitated by the fact that nearly all of them treat learning as a 
stochastic process. By this I mean that response probabilities serve as 
systematic dependent variables and laws derived from the theories 
take the form of mathematical operators or equations describing 
changes in response probabilities during learning. The theories devel- 
oped by Bush and Mosteller, Detambel [18], Miller and McGill [74], 
Restle [80, 81], Wyckoff [96], and me and my colleagues at Indiana all 
follow this pattern, and there are indications that even Hull’s quanti- 
tative theory, as revised by Spence [86], is coming into the fold. In 
fact, all these theories except that of Hull employ substantially the 
same response model. Differences in approach among the theorists are 
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associated in large part with different starting points. Miller and Mc- 
Gill, Restle, Wyckoff, and Detambel all began with specific empirical 
problems. I and my colleagues have begun theory construction in 
some cases with specific empirical problems and in some cases with 
attempts to formalize and test explanatory hypotheses; the latter 
include, for example, hypotheses having to do with stimulus fluctua- 
tion and with contiguity-interference principles. Bush and Mosteller 
began with a particular limited mathematical system which seemed 
likely to have applications to learning and other psychological 
phenomena; they have proceeded to develop this mathematical system 
in great detail and have applied it to a number of experimental 
problems, e.g., symmetric choice problems, avoidance learning, and 
imitation, thus generating limited descriptive theories for these 
situations. 

The empirical scope of Bush and Mosteller’s work has been con- 
fined largely to certain aspects of acquisition and extinction, especially 
in situations involving partial reinforcement. Some correspondences 
between their approach and mine in this area may be illustrated in 
terms of one of the experimental situations that both have dealt with, 
the verbal conditioning experiment. In our approach to this situation, 
Straughan and I [37] showed that starting with assumptions of stimu- 
lus sampling and association by contiguity, one can derive the quanti- 
tative laws (discussed in a preceding section) 


bayi = (1 — O)pn + O 


expressing the increment in response probability resulting from a rein- 
forcement of the given response, and 


Pny = (1 — O)pn 


expressing the decrement resulting from reinforcement of an opposed 
response. Bush and Mosteller [13, chap. 13] arrive at the same laws by 
a different route. Their general model is assumed to be applicable to 
any learning situation in which events, e.g., reinforcements or non- 
reinforcements, produce changes in response probabilities describable 
by the linear transformation 


Qhn = apn + (1 — a)d 


where p represents response probability before the given event has 
occurred, and Qp the new value of response probability after the event. 
In Estes and Straughan’s equations, © represents the stimulus sam- 
pling ratio; in Bush and Mosteller’s, œ and X are free parameters which 
must be evaluated in terms of the conditions of a particular application. 
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To deal with the two-choice guessing situation, Bush and Mostel- 
ler define two operators, say 


Qipa = aıpn + (1 — a) 
and Qipa = an + (1 — az) 


to represent the effects of reinforcement and nonreinforcement, 
respectively. By considering known experimental facts, they place 
restrictions on the parameters. It is known, for example, that in this 
type of experiment, 100 per cent reinforcement and 100 per cent 
nonreinforcement of a response lead to asymptotic probabilities of 
unity and zero, respectively. In order to represent this fact in the 
model, Bush and Mosteller impose the restriction \; = 1 and M = 0. 


Other experimental facts concerning asymptotic probabilities under 


partial reinforcement appear to require the further restriction 
&ı = a, = a. Now their equations take the form 


Qifn = Un t1 a 
and Qopn = Apn 


for this situation. Since the name given a parameter is of no conse- 
. quence, their parameter & could be relabeled 1 — 9, and their 
equations would become 


Qpn = (A — O)pn + 9 
and Qpan = (1 — O)pn 


It is clear that, except for unessential differences in notation, the equa- 
tions resulting from specialization of Bush and Mosteller’s general 
model have the same form as those derived by Estes and Straughan 
from the assumptions of statistical learning theory. Bush and Mosteller 
use the same general methods of derivation as do I and my colleagues, 
so the mean curves of learning derived for given experiments with 
various schedules of random reinforcement turn out to be identical in 
form no matter with which of these approaches one has started. _ 
The fact that two models have been found to deliver the same 
learning functions when applied to one or more common situations 
does not imply that they will always do so, for the models may differ 
with respect to properties which do not reveal themselves in those par- 
ticular applications. From a mathematical viewpoint there are a num- 
ber of important differences between the linear operator model de- 
veloped by Bush and Mosteller and the corresponding model asso- 
ciated with statistical theory. The former may be characterized as an 
infinite, Markovian, stochastic model. By stochastic we refer simply to 
the fact that changes in behavior during learning are represented in 
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terms of changing response probabilities. By Markovian we mean that 
the entire course of learning is, in principle, predictable given the set 
of response probabilities characterizing a subject’s state at any one 
time. The model is infinite in the sense that response probability is 
permitted to vary over an infinite set of possible values. 

By contrast the set-theoretical model developed by Estes and Burke 
[32], which is basic to the learning functions discussed in earlier sec- 
tions of this review, may be characterized as a finite, non-Markovian, 
stochastic model: finite because response probability is restricted to a 
finite set of values; non-Markovian because in the general case, predic- 
tion of the course of learning requires not only information concerning 
response probabilities at some particular time but also a complete 
specification of the distribution of © values over the subsets of stimulus 
elements connected to the various response classes. In the “equal ©” 
case, however, the set model is Markovian, and, because it is also 
finite, has even simpler mathematical properties than Bush and 
Mosteller’s infinite model. 

What is the import of these mathematical differences for problems 
of learning theory? One obvious conclusion is that where both are 
applicable, a choice between the linear operator model and the equal 
O case of the set model comes down pretty much to a matter of per- 
sonal preference on the part of the investigator. Theoretically it is 
possible to derive differential predictions from the two models 
(in the application to verbal conditioning experiments, for example, 
this could be done for variances around mean learning curves), but 
I doubt that the differences would be empirically detectable with 
present levels of experimental control and mensurational precision. 
Bush and Mosteller have investigated the mathematical properties of 
the infinite model extensively, and many of their results, e.g., those 
having to do with bounds on mean curves, the combining of classes 
theorem, and statistical estimation procedures, are applicable to the 
finite model as well. In the case of problems for which stimulus varia- 
bles are important, the set model will be the natural choice since it 
provides a natural means for representing stimulus properties. Even 
here there is a parallelism between finite and infinite models, for, as 
Bush and Mosteller have pointed out [13, chap. 2], one can construct 
an infinite set model from which the operators of an infinite stochastic 
model are derivable. 

It is interesting to note that other investigators have arrived by 
very different routes at stochastic models similar in formal properties to 
those discussed above. Consider, for example, the work of Miller and 
McGill on free-recall verbal learning [74]. They started with no ap- 
parent attachment to any particular variety of learning theory, and 
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set themselves the task of constructing a model that would account 
in detail for a specific set of data. Their work is remarkable for the 
extent to which the form of their model was dictated by the quanti- 
tative properties of the data. Only after their model, an infinite, 
Markovian, stochastic model in the classification used above, had 
taken definite form did its close interrelations with those of Bush and 
Mosteller and Estes and Burke become apparent. The difference 
equations, or operators, expressing the effects of successful recalls and 
failures of recall, respectively, in Miller and McGill’s model turn out 
to be identical in form with those expressing effects of reinforcement 
and nonreinforcement in the others. 

In the area of discrimination learning, Restle [80, 81] has devel- 
oped a model which, from a mathematical viewpoint, may be re- 
garded as a modification of Estes and Burke’s “equal O” case. Restle 
assumes that two processes go on concurrently in discrimination learn- 
ing; the subject simultaneously associates relevant cues with appro- 
priate responses and “adapts” (learns to ignore) irrelevant cues. The 
equations describing the combined process are relatively complex, but 
for a situation in which all cues are relevant, they reduce to those of the 
“equal 6” model. 

Extrapolating somewhat the trends disclosed by the present re- 
view, I would guess that the outcome of the various contemporary 
research programs in quantitative learning theory will be, not a num- 
ber of competing schools or systems, but a stock of common methods 
and a descriptive frame of reference upon which reasonably general 
agreement can be reached. The mathematical fruits of these researches 
—e.g., set models, linear operator models, Markov models, techniques 
for dealing with response sequences—quickly become common prop- 
erty and frequently find application in problem areas quite different 
from those in which they have originated. The general conception of 
learning as a probabilistic or statistical process has been taken up by so 
many contemporary investigators of diverse backgrounds and research 
interests that it seems likely to dominate forthcoming lines of theo- 
retical development. 

Within the collection of limited theories generated by the various 
statistical and probabilistic approaches, we may distinguish three main 
types which pose somewhat different problems relative to integration 
and generalization. 

1. At one extreme are miniature theories in the strict sense, i.e., 
those associated with models that have been constructed to order for a 
particular set of data. Only under favorable conditions of diligent 
cultivation and cross-fertilization do such theories show any great 
tendency to grow and expand. As one may see vividly in Hilgard’s 
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review [52], it is all too common for miniature theories to bloom and 
wither away in a single season, leaving almost no trace on the broader 
field of learning theory. Some of the measures which can be taken to 
avoid isolation and sterility of a miniature theory are illustrated in the 
work of Miller and McGill [74]. After constructing a model to repre- 
sent a particular type of verbal learning data, they proceeded to 
study the general class of models to which theirs belonged and thereby 
to make contact with related approaches. 

2. A family of formally related miniature theories may be gen- 
erated by a series of particular interpretations of a general mathe- 
matical model or system. This technique is exemplified in Part 2 (ap- 
plications) of Bush and Mosteller’s book [13]. The common mathe- 
matical system does not guarantee the emergence of a general theory 
but it may facilitate attempts to generalize and integrate the local 
models. 

3. A theory may be both limited and general: limited in that it is 
restricted to relationships among a small set of variables, but general 
in that these variables and relationships are assumed to play similar 
roles in a variety of empirical situations. Illustrative of this category 
are the theories of reinforcement, stimulus sampling, and stimulus 
fluctuation discussed in earlier sections of the present report. This 
approach stands or falls on the consequences of testing the general 
assumptions in a series of particular situations. When successful, it 
generates a family of both formally and substantively related miniature 
theories. Such an outcome probably represents the closest approxima- 


tion to general learning theory that we can hope to achieve in the 
foreseeable future. 
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INTRODUCTION 


The experimental data emanating from the Wisconsin Primate Lab- 
oratory during the last quarter-century created in the author vast skepti- 
cism concerning the learning systems which were developing and attain- 
ing popularity. There appeared, however, to be no point in opposing or 
attacking such systems, since their fundamental limitations lay not in the 
underlying logic, but in the fact that they were based on circumscribed 
and inadequate information. It became apparent that a major contribu- 
tion to theoretical psychology could be made only by developing an 
alternative system which could incorporate these data that failed to fit 
into the current theoretical schemata. 

Both simple observation and experimentation made it obvious that the 
role of the internal drives had been overemphasized in the dominant con- 
temporary theories. This overemphasis appeared not only in the unten- 
able position of drive-reduction learning, but was also indicated by the 
undue percentage of studies, stemming from many laboratories, in which 
some deprivation condition was the primary variable under considera- 
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tion. Our own investigations made it quite clear that internal states are 
variables of small importance in most learning situations and that in 
many situations, their importance lies more in the suppression than in 
the augmentation of behaviors essential to successful environmental adap- 
tations by organisms. 

Similarly, it was evident that overemphasis upon internal motiva- 
tional agents had led to underestimation of the role of external motiva- 
tional agents—the stimuli producing forced movements or tropisms in 
primitive organisms, releaser functions in higher fish, birds, and reptiles, 
and exploratory-curiosity motivation in our common laboratory animals, 
including man. In an effort to assign proper emphasis to external stimuli 
as motivating agents, we conducted a series of studies on exploration and 
curiosity motives, and these data, supplemented by data on the subpri- 
mates, now provide a wealth of information on the importance of in- 
centives, Tt may be noted that these data of ours are far more in keeping 
with Tolman’s general position than with that of any other learning 
theorist. 

We became convinced that many of the shortcomings of contempo- 
rary theoretical systems stemmed from the fact that there exist practically 
no data on the ontogenetic development of learning, perception, and 
motivation in any animal other than man, and unfortunately, it is diffi- 
cult to assess with accuracy the relative roles of maturational and learn- 
ing factors in the human organism’s development. In an effort to de- 
termine how and when various perceptual, learning, and motivational 
processes develop, we initiated a major program in which macaque 
monkeys were separated from their mothers at birth. After three years 
of effort we have obtained a large body of facts which may be exchanged 
for the deductive psycho-ontogeny of the past. We are quite certain now 
that form discrimination is almost entirely a function of maturation 
rather than the learned formation of cell assembly systems or any other 
postulated learning mechanism. We know that exploratory motives are 
not derived from internal drives, which in many cases they antedate 
ontogenetically; we have presumptive evidence that many or most social 
motives also are not derived from internal drive states. We know, too, 
that single-problem learning is a much simpler process than the forma- 
tion of learning sets. These are all data which may be used in a formula- 
tion of a system, if adequate frameworks and integrations can be de- 
veloped. 

Learning theory, after Tolman, repressed the higher mental processes 
—complex learning and problem solution. Tolman had, of course, at- 
tempted to deal with these processes in terms of inferential expectation, 
insight, or creative ideation, but Tolman failed here because at the time 
nothing was known of the mechanisms that would enable us to trace the 
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development and organization of these complex behaviors. We are unin- 
terested in any system that cannot in an orderly way attack the problems 
of complex behaviors because our paramount concern transcends the 
study of the comparative psychology of the subprimate forms. Further- 
more, we are firmly convinced that the analysis of simple behavior gives 
relatively little information about the nature of complex behavior, 
whereas the successful analysis of complex behavior may give vast insight 
into the nature of simple behavior. 

We believe that learning set formation and error factor analysis are 
fundamental and effective techniques for analyzing complex learned be- 
haviors and for understanding the simpler learned behaviors. Further- 
more, we believe that the detailed understanding of learning sets and 
error factors will lead to the integration of a very considerable body of 
psychological information. Firm in this belief, we agreed in 1954 to cover 
the topic of “Systematic Implications of ‘Learning Sets’ and Related 
Matters.” 

When we accepted the task of contributing to Project A, we were in 
no position to attempt the development of a psychological system in the 
sense that this term is presently understood. Whether this would be true 
if we could start now may be open to question, but unfortunately we 
cannot live life backward. Actually, the materials here presented will be 
basic to any system we may present in the future. 

In 1949 the author [20] demonstrated that rhesus monkeys learn suc- 
cessive nonspatial discrimination problems with progressively greater 
facility and referred to this phenomenon as the result of the formation 
of a “learning set,” a term abbreviated as “LS.” Just as learning is gen- 
erally regarded as a hypothetical construct—an intervening variable— 
so, also, is learning set treated. Learning set formation has also been 
called “interproblem learning” to contrast it with “intraproblem learn- 
ing,” which is limited to the learning that occurs during the solution of a 
single problem. 

Study of the effect of the learning of a single problem on the learning 
of subsequent problems has a long psychological history—the problem 
area of transfer of training. Learning set formation represents a partic- 
ular type of transfer of training, the transfer between many problems of 
a single class instead of the more commonly studied transfer between 
problems of disparate classes or transfer between a few problems of a 
single class, Transfer of learning between problems within a single class 
was studied before Pavlov, by Pavlov [53], and after Pavlov, but early 
investigators failed to develop techniques for effective analysis or to rec- 
ognize the significance of the phenomenon. 

Previous workers studied interproblem learning over a narrow range 
of problems and usually trained their subjects to mastery, or near mas- 
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tery, of all problems. After some equivocation, the author [21] departed 
radically from these procedures; he ran each individual problem for a 
small number of trials and analyzed learning in relatively large blocks 
of problems. 


INTRAPROBLEM AND INTERPROBLEM LEARNING CURVES 


The apparatus and test situation which we have commonly used in 
learning set studies is shown in Fig. 1. In the discrimination problem 
being tested two stimulus objects differing in multiple characteristics 


ne-way vision screen 


Fic. 1. Wisconsin General Test Apparatus. 


cover the two foodwells of the test tray. In this instance the barrel-shaped 
stimulus object is correct and rewarded, the cube incorrect and not re- 
warded, The positions of the objects on the tray shift from left to right 
in a balanced, predetermined order in the series of six trials. A noncor- 
rection method is used. Regardless of the animal’s performance level, a 
new problem employing new objects follows directly after the completion 
of trial 6. Ten different sequences of position rewards are randomly pre- 
sented in each block of 10 six-trial problems. Animals are given a num- 
ber of problems in a single session, each problem presenting new stimulus 
objects, In some experiments, monkeys have completed as many as 14 
problems a day (84 trials) every day for several months. 

Interproblem learning experiments are not, of course, limited to the 
apparatus, method, or kind of learning task described. They may utilize 
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any apparatus or task the experimenter chooses, and procedures must be 
devised to fit the experimenter’s needs. The only invariable requirement 
is that there be multiple problems of comparable difficulty level. 

With rhesus monkeys as subjects, learning sets evolve even though 
each problem is run for only a limited number of trials [21]. If six-trial 
nonspatial discrimination (hereafter called “discrimination” ) problems 
are used, the development of the interproblem learning curves and the 
change in the form of the intraproblem learning curves are very clearly 
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Fic. 2, Family of learning set curves. 


illustrated. As is shown in Fig. 2, the learning of the early problems takes 
place slowly, and the learning curve is typical of what has been called 
trial-and-error learning. Little, if any, demonstrable gain occurs during 
the early trials of any particular problem, and the form of the intra- 
problem learning curve, or the obtained segment of a particular intra- 
problem learning curve, is positively accelerated. As successive problems 
are presented, progressively greater performance gain occurs during the 
early intraproblem trials, and the form of the learning curve becomes 
sigmoidal and then changes to a curve of negative acceleration. By the 
time a hundred problems have been presented, there is indication of the 
formation of a sharp break in the learning curve; now, the trial 1-trial 
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2 segment of the curve is best described by a different mathematical 
function from that which best describes the segment from trials 2 to 6. 
From approximately one hundred problems onward, the intraproblem 
learning curve partakes of the properties of the classical insight learning 
curve. Performance changes from chance to perfection or near perfection 
in a single trial, and subsequent errors are insignificant. From 90 to 97 
per cent correct responses are made on the second trial, and individual 
animals may run dozens of successive problems with no trial 2 errors, or 
no errors throughout trials 2 to 6. If the monkey by chance makes a cor- 
rect response on trial 1, it solves the problem without error, and if it by 
chance makes an error on trial 1, it immediately shifts to the correct 
object and makes no subsequent errors. 

Thus, the formation of an LS results in a complete transformation in 
the form of the intraproblem learning curves. The early problems are 
learned slowly, the learning curves are positively accelerated, and each 
learning curve is continuous. The late problems are learned rapidly or 
immediately, and the resulting learning curve is discontinuous, having 
two distinct components, a rapid component from trial 1 to trial 2 and a 
slow component from trials 2 to 6 [20]. 


FORM OF LEARNING SET CURVE 


The nature of the development of interproblem learning curves is 
well illustrated by plotting the percentage of correct responses made on a 
particular trial, such as trial 2, or a sequence of trials, such as trials 2 to 
6, during training on successive blocks of problems. In other words, just 
as the individual-problem learning curve is plotted with trials on the 
abscissa and performance efficiency on the ordinate, so is the LS curve 
commonly plotted with problems on the abscissa and performance effi- 
ciency on the ordinate. Interlocking information may be given by plot- 
ting successive problems or problem-blocks as the parameter on the 
individual-problem graph, or by plotting successive individual trials or 
selected trials or groups of trials as the parameter on the LS graph. These 
facts are illustrated in Fig. 3a and 3b. From these data it is self-evident 
that intraproblem learning becomes discontinuous after the LS has been 
formed, but that the interproblem learning curve, or LS curve, is con- 
tinuous. 

The classical discrimination LS curve for rhesus monkeys that have 
been carefully tamed and preadapted to the Wisconsin General Test 
Apparatus (WGTA) is negatively accelerated with no indication or sug- 
gestion of discontinuity. Fortunately, we now have LS data on rhesus 
monkeys on a variety of learning problems and discrimination LS data 
on a considerable number of primate species and some nonprimates. 
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Most rhesus LS curves, including discrimination, discrimination-reversal, 
and oddity, are negatively accelerated, but the LS curve for moderately 
difficult discrimination problems, such as the planometric (flat) prob- 
lems studied by Harlow and Warren [28], have an initial flat portion 
during which little or no improvement is to be found. Similar results 
have been reported for individual rhesus monkeys on oddity problems 
[45] and may be demonstrated in occasional individual cases on almost 
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any kind of learning problem. Furthermore, inadequate taming or ab- 
breviated adaptation to the test situation will characteristically produce 
an initial flat portion in the learning curve for any animal on any test. 
In some of the more primitive primates, including the marmoset and the 
squirrel monkey, a prolonged initial flat portion is found in which there 
is little or no learning over a long series of discrimination problems. The 
data on the nonprimate forms are not of such technical elegance as to 
make direct comparison possible, but such data as exist are concordant 
with the trends described. In general, it may be stated that LS curves 
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which have an initial flat function subsequently assume a negatively ac- 
celerated function, i.e., the curves become S-shaped with a shorter or 
longer foot to the S. 

These data also make it quite obvious that there is no single or ideal 
form of LS curve, but, rather, the form is a function of the problem, the 
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subject, and the experimenter’s whimsies, recognized or unrecognized. 
All the LS data, however, yield continuous learning curves, and this con- 
tinuity of the LS curves is in striking contrast with the discontinuity of 
the intraproblem learning curves typically found late in LS formation. 
Regardless of any differences in the form of intraproblem and inter- 
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problem learning curves it would seem almost self-evident that some close 
relationship exists between single-problem learning and multiple-problem 
learning. Furthermore, as we shall subsequently see, this presumption is 
strongly supported by detailed analyses of the error factors (EF’s) whose 
elimination underlies learning. Other presently existing techniques for 
demonstrating or analyzing the relationship between intraproblem learn- 
ing curves on successive blocks of problems are worthy of note. Meyer 
[43] plotted the learning curve derived from the day 1 error data—the 


first four problems in a series of discrimination reversals—and showed 
this curve to be the decay function commonly found when percentage of 
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errors is plotted against trials in a single problem. He then grouped the 
rest of the problems into three successive blocks and obtained the aver- 
aged intraproblem learning curve for each block. Plotting these three 
curves on the same axis as the day 1 curve in such a way that the trial 
2 value of each averaged curve coincided with the point of equivalent 
value on the day 1 curve, he found two results making for startling order- 
liness, as shown in Fig. 4. First, the scores for the rest of the trials fit 
extremely closely to the function described by the day 1 trials data. 
Meyer, in fact, concluded that one function fitted all the data when they 
were plotted this way. Secondly, the interval between the points at which 
trial 2 values matched the day 1 values was constant from block to block. 
Unfortunately, no retest has been made of this ingenious and surprising 
finding, nor has any measure of its generality been attempted. 
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LEARNING SET AND EFFICIENCY OF LEARNING 


Comparison of LS curves obtained in a few studies suggests that the 
number of trials presented per problem is an important determiner of the 
over-all slope of the LS curve and the efficiency with which a predeter- 
mined criterion is reached. Although major systematic studies manipu- 
lating this variable have not been done, comparisons among existing ex- 
periments indicate that the smaller the number of trials per problem, the 
greater the number of problems required to reach a given criterion of 
learning, but the fewer the total number of trials to reach this criterion. 
There appear to be two major aspects to the trials per problem variable: 
the number of trials per problem and the amount learned on any given 
problem. Because the relationship between these two aspects of the vari- 
able depends upon the stage of LS development, we would expect some- 
what different predictions about interproblem transfer at different devel- 
opmental periods. 

Early in LS acquisition the intraproblem learning curve is positively 
accelerated. Little learning takes place on the first few trials of any prob- 
lem. One would expect, then, that if transfer were dependent upon 
amount learned, a small number of trials during the early stages of LS 
development would produce little or no improvement from problem to 
problem, whereas a number of trials large enough to allow effective intra- 
problem learning would produce considerable transfer. During advanced 
stages of LS development, however, the intraproblem learning curve be- 
comes negatively accelerated and eventually discontinuous. Maximal 
intraproblem performance gain occurs by the second trial, with very 
small gain in the later trials. At this stage of LS development one would 
expect, if amount of intraproblem learning rather than sheer number of 
trials per problem were critical, that a few, in fact, two, trials per prob- 
lem would be most efficient. 

As we will subsequently show, this hypothetical analysis very closely 
fits the facts. The available data support the hypothesis that the amount 
learned on a given problem is an important variable determining the 
amount transferred to the succeeding problem. Transfer from problem 
n— 1 to problem n, according to this hypothesis, is a monotonic in- 
creasing function of the amount learned on problem n — 1, rather than 
a function of the sheer number of trials presented on that problem. A 
few interesting predictions follow from this hypothesis. For one thing, 
from knowledge of the changes in the intraproblem learning curve dur- 
ing LS formation, one can estimate what would constitute the most effi- 
cient way to plan LS experiments. Maximal efficiency would be the 
attainment of a given criterion of LS performance in the fewest over-all 
number of trials. In general, the technique is to run the animal, on any 
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given problem, through the trials on which the gains are greatest. To 
run the animal on further trials, where there is diminishing gain, would 
be wasteful of trials. Thus, on the early problems, when the intraproblem 
curves are S-shaped, it would be most efficient to run the animal a few 
trials beyond the point of inflection because it is around this point that 
the increase in learning is greatest. Several trials after the point of in- 
flection we begin to enter the realm of diminishing returns, and the 
problem should be discontinued and the next problem begun. Late in 
LS formation, when the intraproblem curve is a negatively accelerated 
growth curve, it would be most efficient to run the animal for two trials 
per problem and no more, since it is from trial 1 to trial 2 that the great- 
est increment in learning is made. Between the very early and very late 
problems the number of trials per problem would be gradually reduced. 
Although the currently available data do not warrant greater speci- 
ficity of speculation than we are engaging in at present, nevertheless the 
trial at which the animal should be switched to the next problem may be 
precisely determined if we assume some exactly specifiable relationship 
between learning and transfer, e.g., amount transferred is directly propor- 
tional to the amount learned (T = KL). Marvin Levine at our labora- 
tory has demonstrated as an exercise that when the relationship is stated 
in such a mathematically precise manner, exact predictions may be made. 
Other predictions which follow from our assumption that transfer is 
a direct function of amount of learning deal with variables producing 
differences in intraproblem learning. Thus, within a group of animals 
receiving identical LS training procedures we would predict a significant 
positive correlation between average intraproblem performance and the 
speed with which a given LS criterion is reached. Also, if number of 
trials is held constant, the more difficult the intraproblem learning condi- 
tions (e.g., greater similarity of the stimuli), the slower the LS acquisi- 
tion, As we shall see later, the dependence of interproblem transfer upon 
intraproblem learning is subject to an important limitation. The capacity 
to acquire LS may not always be present in an organism even though it 
can learn single discriminations. The hypothesis of transfer, therefore, 
must be modified to hold only when animals of equal capacity are con- 
sidered. Within this limit, however, the hypothesis promises to be fruitful. 
The available data appear to substantiate the hypothesis that it 
is the amount of intraproblem learning which determines the interprob- 
lem transfer. Hayes, Thompson, and Hayes [29] found that a chimpanzee 
which began training at a fixed rate of two trials per problem remained 
at about chance level for 300 problems but improved rapidly when 
shifted to 10 trials per problem. Unpublished Wisconsin data on two 
rhesus monkeys trained under similar schedules yielded comparable 
results. Miller, Murphy, and Finocchio [48] demonstrated the develop- 
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ment of LS using three trials per problem from the beginning. Early 
learning was slow. After 192 problems, correct performance on trial 
2 approximated 60 per cent. Similar results were obtained by Davis, 
McDowell, and Thorson [9] on four-trial discrimination problems. After 
128 problems, the animals had not yet attained 60 per cent correct 
responses on trial 2. 

With respect to performance late in LS development, several studies 
indicate that a small number of trials produces efficient learning. In one 
study, Harlow [20] switched the animals after 32, 50-trial problems to 
6-trial problems without significant disruption in the rate of improvement 
in LS development. Braun, Patton, and Barnes [6] showed efficient ac- 
quisition late in learning when the number of trials per problem had been 
reduced to three. Finally, Hayes, Thompson, and Hays [29] report that 
two trials per problem are effective for the chimpanzee in advanced 
stages of discrimination LS formation, and these data are in keeping 
with unpublished results obtained on monkeys at Wisconsin. 


A MATHEMATICAL FORMULATION OF LEARNING SET 


Currently, an attempt is being made to give mathematical formula- 
tion to LS phenomena. Restle is expanding a mathematical model de- 
veloped to handle simple discrimination learning.’ He conceives of the 
stimulus situation in simple discrimination learning as containing two 
types of cues: cues which are relevant or correlated with the reward and 
cues which are irrelevant or uncorrelated with the reward. During learn- 
ing, relevant cues become conditioned and irrelevant cues become 
adapted. Both processes are exponential, and the rates are determined by 
the numbers of both types of cues. 

By adding a few postulates, he has extended the model to handle LS 
data. His basic postulate is that in the LS situation there exist certain 
cues which are relevant and common to all problems. In oddity LS, 
“oddness” provides such a cue. In discrimination LS, the cue is the ob- 
ject reinforced on the preceding trial. In LS formation there are, then, 
three types of cues: type a cues, which are relevant throughout and per- 
mit transfer from problem to problem; type b cues, which are relevant 
within any one problem but which vary from problem to problem; and 
type c cues, which are irrelevant within any one problem. Type c cues 
become adapted in a manner analogous to their adaptation in simple dis- 
crimination. Type b cues, though relevant within any one problem, are 
irrelevant in the long run, and therefore eventually become adapted. The 
type a cues, of course, become conditioned. 


"Dr. Frank Restle, Michigan State University, has given us permission to 
Present this outline form of his mathematical formulation of learning set theory. 
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Stating in mathematical terms the assumption that the processes of 
conditioning and adaptation proceed exponentially, Restle is able to de- 
rive equations expressing the probability of a correct response at any 
point on an LS curve. Also, he has derived theoretical descriptions of 
findings which already exist in the LS literature. He has produced curves 
which closely fit data obtained by Riopelle [57] in a reversal learning 
situation, and a curve describing the within-problems learning curve 
found by Meyer [43]. Using data obtained by Warren [68], he has pre- 
dicted performance in a discrimination learning situation in which com- 
binations of cues are relevant (color + form, form + size, size + color, 
and color + size + form), on the basis of performance in situations 
where only one of the three types of cues is relevant. 

Although Restle notes several limitations which currently exist in his 
formulation, it is nevertheless clear that his model provides the most 
rigorous attempt to date to give theoretical organization to LS data. 


ONTOGENETIC AND PHYLOGENETIC DEVELOPMENT AND 
LEARNING SET FORMATION 


The importance of LS has sometimes been questioned on the grounds 
that it is merely an instance of transfer of training. The assumption has 
also been made that intraproblem learning and interproblem learning are 
the same process. Interproblem learning, according to this view, is no 
new phenomenon of particular psychological importance. There is evolv- 
ing, however, a considerable body of evidence showing that, in spite of 
the similarities between intraproblem and interproblem learning, the LS 
learning is dependent upon some capacity factor or factors transcending 
those needed for intraproblem learning of equivalent problems. 

Probably the most definitive measure of learning difficulty is the 
absolute or relative age at which various problems can be solved by rep- 
resentative members of a species, particularly if past experience is equated 
or controlled. This technique has been used with human beings in the 
construction of mental maturity scales and intelligence tests and has given 
us such constructs as maturation and mental age. The age at which most 
of our standard laboratory learning problems can be solved by infant 
and young monkeys, separated from their mothers at birth and raised 
in a standardized laboratory environment, is now being determined by 
the Wisconsin workers [27]. One of the most striking facts already dis- 
closed is the large temporal separation between the development of ef- 
fective individual-problem learning and the development of LS’s for the 
same kind of problem. 

Rhesus monkeys can solve individual discrimination problems in the 
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WGTA situation by 60 days of age. The learning is reasonably rapid 
and efficient on the first problem, requiring from 50 to 100 trials. In 
spite of the fact that these young animals are capable of effective intra- 
problem learning, they appear to be completely incapable of forming a 
discrimination LS. They can learn a discrimination problem or a series 
of discrimination problems, but there is no indication, or practically no 
indication, of interproblem transfer. The capacity to form a discrimina- 
tion LS certainly does not develop before 150 days of age in the rhesus 
monkey; at 200 days of age these animals show only limited interprob- 
lem learning capability, and full LS capacity probably does not develop 
until well into the second or third year of life. 

The operation of some capacity factor or factors in the formation of 
LS’s is illustrated as clearly by the phylogenetic as by the ontogenetic 
data, although these data are limited both in terms of species and prob- 
lems which have been studied. All available evidence indicates that sub- 
primate animals possess only the most rudimentary capabilities for solving 
discrimination LS’s. Koronakos and Arnold [38] have recently presented 
evidence that rats may show limited interproblem discrimination learn- 
ing. Twenty rats were trained on each of seven discrimination problems 
to a criterion of 80 per cent correct responses in 20 consecutive trials or 
on a total of 80 trials if this criterion was not attained. Five subjects met 
the criterion for all problems, and their composite performance curve 
shows improvement, attaining a maximum of approximately 70 per cent 
correct responses on trials 1 to 20 of problem 6. It must be recognized 
that the number of problems was small and the basis for the improvement 
is not specifically defined. i 

Somewhat more precise evidence for discrimination LS formation has 
been demonstrated by Warren and Baron [69] for the cat. Four subjects 
were trained on 340 discrimination problems. The first 4 problems were 
learned to a criterion of 20 correct out of 25 consecutive responses, and 
the others were presented for fixed periods of trials, 50 trials for problems 
5 to 80, 25 trials for problems 81 to 140, and 10 trials for problems 141 
to 300. The asymptote for performance on trials 2 to 10 is approximately 
70 per cent. The trial 2 performance data are not given. The authors 
state that: “Although the data presented here show that the capacity 
for acquiring learning sets is not restricted to the order Primates, it is 
clear that the learning set curves for cats quickly reach an asymptotic 
ae of performance which is much lower than for monkeys” [69, p. 

A Actually the LS formation of nonprimates and primates is not only 
differentiated by the asymptotic level of performance but also by the 
level of performance throughout most of the learning. For the primates 
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the intraproblem learning curve after LS formation shows clear evidence 
of discontinuity, a phenomenon not reported for any subprimate form 
to date. 

An extremely interesting picture relating LS capacity to evolutionary 
development is obtained by comparing discrimination LS formation by 
representative families and genera within the primate order. The rela- 
tively primitive marmoset and squirrel monkey have been shown by 
Miles and Meyer [47] and by Miles [46] to develop discrimination LS’s 
much more slowly than the rhesus monkey. Performance by the three 
tested subjects of each species is hardly above chance after 200 six-trial 
problems, and at the end of 1,000 problems the trial 2 performance is 
far from perfect, as is evident in Fig. 5. It should be pointed out, how- 
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Fic. 5. Learning set performance of squirrel monkeys and 
marmosets compared with rhesus monkeys. Reproduced from 
J. comp. physiol. Psychol. [47]. 


ever, that the trial 2 performance is far better than any that has been 
demonstrated for any subprimate form, and the intraproblem learning 
curve late in LS formation is similar to that found for the rhesus monkey. 
As would be predicted in terms of cortical structure, the performance of 
the squirrel monkey is superior to that of the marmoset, and there is 
very little overlapping on LS performance. 

Additional data on LS formation among various genera within the 
primate order have been supplied by Riopelle [58], who tested squirrel 
monkeys, cebus monkeys, and spider monkeys on series of six-trial dis- 
crimination problems. Figure 6 shows that performance of the cebus 
monkeys lay between that of the squirrel monkeys and the spider 
monkeys, and the performance of the spider monkeys closely approxi- 
mated that of rhesus macaques. It is interesting in passing to note that 
the cebus monkey, the most able of the subanthropoids in instrumental, 
tool-using tasks, proves to be inept on the “academic-type” discrimination 
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problems, and that the spider monkey performs very efficiently on 
discrimination problems despite its missing or vestigial thumbs. . 
Comparable discrimination LS performance has been obtained on 
rhesus and Java macaques, and similar performance was obtained on a 
single well-motivated langur at the Wisconsin laboratory. There is every 
reason to believe that the mangabey and baboon would perform equally 
effectively. The small variance in LS performance in the catarrhine 
monkeys contrasts with the great interspecies differences found in the 
platyrrhine forms. All existent discrimination LS data on all measured 
species are in keeping with the anatomical data bearing on cortical com- 
plexity, and it is obvious that LS techniques are powerful measures for 
the intellectual ordering of primate and possibly even nonprimate forms. 
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The chimpanzee and human data are not directly comparable with 
the monkey data because of experiential factors with human subjects and 
motivational differences in the testing situations with chimpanzees and 
human subjects, but such data as exist are in accord with phyletic pre- 
dictions. Although Hayes, Thompson, and Hayes state that “the data 
available suggest that when pre-experimental experience is equated, man, 
chimpanzee, and rhesus monkeys differ very little in their ability to 
acquire sets for object-discrimination learning” [29, p. 103], their position 
must be challenged on a number of bases, including data of other ex- 
periments, The human subjects used by Hayes were quite immature, 
ranging from two to four years of age; our data suggest that infant 
rhesus monkeys of roughly comparable maturational age are also inept in 
LS formation. The comparison of the behavior of young children and 
adult monkeys departs from a strict interspecies comparison. Further- 
more, human children and presumably young chimpanzees are harder 
to motivate than well-tamed monkeys, which typically respond with the 
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machinelike characteristics of institutionalized subnormal children. Hayes, 
Thompson, and Hayes are obviously aware of these problems relating 
to species differences, differences in living conditions, and differences in 
adaptation to the test situation. The remarkable behavior of the oc- 
casional mature chimpanzee [52] that remains emotionally tractable and 
test-motivated makes it appear unlikely that this species would negate 
the generalization that position in the primate order and LS capability 
are highly related. The data obtained by Harlow [20] and Roberts [60] 
on nursery school children indicate that these subjects possess LS ability 
well beyond that of mature monkeys. The relative status of the mature 
rhesus monkey and man is self-evident. 

By and large the phylogenetic data demonstrate that LS formation 
is closely related to evolutionary position, as conventionally described, and 
to cortical complexity in so far as this characteristic has been effectively 
measured. These data contrast with the little that is known about single- 
problem discrimination learning, for at present there is no evidence to 
indicate that single problems are learned more rapidly by members of one 
genus, family, or order than another within the class of mammals if 
conditions of maturation and past experience are, or can be, equated and 
if equally effective test conditions, adapted to the subjects, are used. 
Hebb has even gone so far as to conclude: “In the infant, the evidence 
is conclusive that the rate of the first learning is slowest in the highest 
species, quite apart from maturation” [30, p- 115]. We would question 
this conclusion because initial discrimination learning in the infant 
monkey is at least as rapid as initial discrimination learning in the infant 
rat, but we would concede that single-problem learning is relatively 
uncorrelated with phyletic position among mammals, whereas LS learn- 
ing is highly related to phyletic position. These data, therefore, support 
the position that some capacity factor or factors operate to differentiate 
single-problem and multiple-problem acquisition. 

One may conclude, then, that there is a capacity factor or factors 
which influence speed of LS formation, asymptote of maximally efficient 
LS performance, and even the ability to form effective LS’s. The factor 
or factors may also determine the minimal number of trials per problem 
essential to establish the LS, and probably determine the maximal com- 
plexity of problems amenable to LS formation, such as conditioned 
response, spatial discrimination, nonspatial discrimination, oddity prob- 
lem, or problems of greater complexity. 


LEARNING SET FORMATION AND CORTICAL LOCALIZATION 


; Above and beyond the ontogenetic and phylogenetic data which as- 
sign a higher level of capacity to LS than to single-problem learning, 
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there exist data from psychophysiology offering support to the separability 
of these two learning functions. In recent years, the effect of various 
temporal lobe lesions on visually guided behavior has attracted wide 
interest. Much of the temporal lobe lies within the classically described 
posterior associative cortex, and although the temporal lobe is rostral 
to the classical visual associative cortex, studies involving simulation [54] 
or extirpation [44] give unequivocal proof that the temporal lobe, or 
portions of the temporal lobe, serves visual associative functions. 

It was first suggested by Riopelle, Alper, Strong, and Ades [59] 
that large temporal lesions which involve both the hippocampus and 
hippocampal gyrus abolish object-discrimination LS’s without producing 
equivalent deficit in single-problem discrimination learning. Such a find- 
ing might be taken as providing additional evidence which differentiates 
single-problem and multiple-problem learning. Supporting data were 
provided by Chow [7], who showed that bilateral extirpation of the 
neocortex abolishes the LS for pattern discriminations during the brief 
period of postoperative testing, but at the same time, it does not affect to 
a measurable degree performance on individual pattern-discrimination 
problems. Meyer's [44] study lends general support to the position that 
LS’s are more vulnerable to temporal resection than are single problems, 
but the issue is clearly complicated by the monkey’s past experience, the 
size of lesion, and individual differences. The greater susceptibility of 
multiple- as contrasted with single-problem learning to temporal lesions 
may result from either quantitative or qualitative differences in difficulty. 
Both Warren and Harlow [70] and Chow [7] have shown that pattern- 
discrimination LS’s are more susceptible to temporal lesions than are 
object-discrimination LS’s. Here again the differences could be at- 
tributable to qualitative or quantitative differences in difficulty inherent 
in the two kinds of LS’s. The problem is, however, unresolvable at our 
present state of knowledge. Regardless of the mechanisms involved, the 
operative studies suggest some separability of single-problem learning 
from LS learning and give supporting evidence to that already provided 
by the ontogenetic and phylogenetic data. 


THE SIGNIFICANCE OF LEARNING SET THEORY 


Up to this point we have concerned ourselves primarily with the 
technical aspects of LS formation. We conceive of LS, however, as more 
than a technique; we believe it to be a model with broad explanatory 
capabilities for a wide range of psychological phenomena. 

One of the most striking phenomena associated with LS formation is 
the change in problem difficulty following multiple-problem practice. 
The initial problems of a class are learned slowly in a trial-and-error 
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fashion. After LS formation the same kind of problem is solved im- 
mediately in an insightful fashion. Furthermore, we believe that this 
phenomenon has great generality and that all insightful learning of all 
problems by all animals is the resultant of LS formation. In other words, 
no animal can solve problems insightfully, or with maximal efficiency, 
without a history of earlier solutions of similar problems. 

The evidence for learning-setless insight was summarized by the 
author in an earlier publication [23, pp. 457-466]. Köhler’s [37] ob- 
servations of insightful learning by chimpanzees has not stood the test of 
time or experimentation as indicated by the researchers of Schiller [61], 
Birch [5], and Bingham [4]. Maier’s [41] discovery of reasoning in rats 
represented an intellectual tour de force independent of the data. It is, of 
course, impossible to control experience with human adult subjects. The 
Tolman [67] insight or inferential expectation studies were not adequately 
controlled, and the interpretations have been questioned by subsequent 
investigators [11, 34]. The two classical studies on insight learning by 
young children in problem situations [2, 42] demonstrated a surprising 
absence of insight even though at least some of the situations were of a 
kind that must have appeared frequently in the children’s past experience, 
and so the children must have had opportunities for LS formation. As of 
the present time there is no satisfactory evidence for the existence of 
unlearned insight. 

Using monkey subjects whose experience was controlled, we have 
obtained evidence that insightful learning through LS formation is a 
widely generalized principle. The phenomenon appears in discrimination 
learning, discrimination-reversal, oddity learning, and nonoddity learn- 
ing, and doubtless would appear in many other problems. 

It is the author’s hypothesis that all concepts such as triangularity, 
middle-sizedness, redness, number, and smoothness evolve only from LS 
formation. Unfortunately, most of the major concept-formation studies 
on naive subjects antedated LS theory, and as a result, the data provide 
no definitive test. The study by Andrew and Harlow [3], involving the 
training and testing of monkeys on a concept of triangularity, approaches 
an LS model for testing concept formation. These investigators trained 
16 monkeys to discriminate a triangle and circle and then tested for 
generalization on 100 new pairs of stimuli. Performance was only 6 per 
cent better than chance. The monkeys were then trained to select the 
more triangular member in 50 of the original 100 pairs, after which 
they were given generalization tests on the other 50 pairs. Performance 
was now beyond 70 per cent correct responses, indicating a significant 
improvement in the formation of a generalized concept of triangularity. 
It is probable that unlimited training on a single problem would never 
produce significant concept formation; indeed, the inverse should occur: 
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with increased practice on a single problem, generalization tendencies 
should decrease, thus narrowing the possibilities for the formation of a 
concept. 

It is not surprising that none of the classical concept-formation studies 
obtained efficient generalization to a wide range of stimuli by training on 
a single problem. Broad stimulus generalization, which is what we call 
concept formation, was obtained only after extensive training on a 
wide range of problems of a class, even though the informal nature 
of the experimental designs makes impossible real quantification of the 
mechanism involved. The studies by Fields [12, 13], Gellermann [14, 
15], Karn and Munn [33], Smith [63], Neet [51], Andrew and Har- 
low [3], Weinstein [71], and Hicks [31] make it quite clear that con- 
cept formation is dependent on an LS mechanism, and this generalization 
holds for all mammalian species that have been tested. 

In view of the extensive literature on transposition, both human [1, 
40] and subhuman [35, 65, 66], it is surprising that no attempt has been 
made to extend these reasearches beyond the model of single-problem 
training and subsequent transposition testing. Actually, Spence in his 
original study on the intermediate-size problem tested his chimpanzees on 
a short series of single problems followed by individual tests and re- 
ported: “The data of two subjects, Lia and Cuba, suggest that response 
might have been on the basis of configurational or relational properties of 
the stimulus complexes” [66, p. 270]. Such findings suggest that a 
thorough attack on the family of transposition problems by means of LS 
methods might provide data making possible the integration of divergent 
theories. 

Consideration of the changing intraproblem learning curves leads to 
predictions relating motivation to learning. Early intraproblem learn- 
ing is slow and attended with multiple errors. Late intraproblem learn- 
ing is rapid and essentially errorless. Errors lead to response refusals 
and relatively random intertrial behavior, characteristic of responses 
to frustration. One would, therefore, predict a progressive reduction 
in responses indicative of emotional disturbance as an LS develops. 
General observation supports this position even if experiments controlling 
learning experience as contrasted with LS learning experience are 
lacking. One would also predict that frustration would be minimized 
if the problems presented early in LS formation were run until a con- 
siderable degree of mastery had been obtained, and that late in LS 
formation frustration would be uncorrelated with problem length. Gen- 
eralizing broadly to human behavior, we hold that original learning 
within an area is difficult and frustrating, but after mastery, learning 
within the same area becomes simple and effortless. Both the intellectual 
and emotional problems associated with learning to read disappear so 
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completely that reading becomes an automatic process for most students. 
Similar phenomena are associated with such learnings as those involved 
in arithmetic and foreign languages. It remains as a possibility that the 
resistance of elderly people to new learning may relate more to as- 
sociated problems of frustration than to learning limitations. 


THE NATURE OF ERROR FACTOR THEORY 


The conventional graphical way of presenting learning data is in 
terms of a curve demonstrating improved performance, defined as a 
reduction in the number or percentage of errors or response time or an 
increase in the number or percentage of correct responses, over a series 
of trials. The effect of other variables such as drives, incentives, temporal 
ordering of trials, and attributes of the stimuli on efficiency of problem 
solution are indicated by plotting families of learning curves on a single 
graph. This treatment of problem solution carries the suggestion that the 
learning involved is some unfractionable unit, such as habit strength, 
with which other variables interact. 

It is the author’s contention, however, that in most learning problems 
multiple factors are operating differentially on every trial to determine 
the course of the plotted learning curve. Such a concept is implicit in 
Spence’s [64] analysis of discrimination learning, in which changes in 
two factors, a positional and a nonpositional factor, are described. The 
long struggle to attain precise nonspatial delayed responses presented a 
wealth of factual and theoretical information on antagonistic operation 
of positional and nonpositional factors during nonspatial delayed response 
learning [22, pp. 223-231]. 

As long as one studies only single problems it is difficult or impossible 
to conduct more than limited analyses of the multiple factors which 
operate to affect learning on the various trials. Single-problem learning 
at the level of difficulty of nonspatial discrimination is usually slow, and 
the patterns of rewards and nonrewards over the long succession of trials 
are too complex for differential analysis. The considerable variability 
among animals, particularly on problems which are learned slowly, adds 
to the difficulty of analysis. Still another complication is that imposed 
by the stimulus preferences animals bring to the experimental situation, 
and with single problems these preferences may distort the learning 
data. 

Effective analyses of the factors underlying learning are possible only 
if one utilizes data from relatively large numbers of problems given to 
each of a number of subjects. The time factor in the collection of such 
extensive data is considerably reduced if the problems are presented for 
only a small number of trials, as has been done in the LS studies. Two 
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techniques have proven useful in this kind of analysis: one is the utiliza- 
tion of a number of different balanced sequences controlling the posi- 
tions of the correct and incorrect stimulus objects within each problem 
and over blocks of problems. The author's original analysis [21, p. 27] 
employed ten such sequences, which included four sequences in which the 
position of the correct stimulus object shifted on the second trial, and six 
sequences in which the first-trial position of the stimuli remained con- 
stant on the second trial. On two of these last six sequences, the position 
of the positive stimulus remained constant through the first three trials. 

A second technique involves presentation of a single stimulus for a 
predetermined number of trials, followed by the conventional two- 
stimulus discrimination presentation. This method offers many pos- 
sibilities for controlling experimental conditions relating to position, 
stimulus-object properties, and incentives. Actually, the general method 
was devised to test generalization [16, 65] and incentive conditions [18] 
before its potentialities for error factor (EF) analysis were explored [26]. 

Analysis of the data from a discrimination LS experiment employing 
multiple positional sequences enabled the author [21] to demonstrate 
in 1950 that it was possible to identify and trace the course of several 
distinct classes of errors made by monkeys on successive trials and over 
the series of problems. These classes of errors were described as being 
produced by EF’s, which may be conceived of as reaction tendencies, or 
interactions between or among reaction tendencies, in the monkeys, lead- 
ing to ordered, but inappropriate, responses to the problems presented. 
As these orderly but inappropriate responses are reduced, the percentage 
of errors diminishes progressively, and learning is said to be taking place. 

In his first analytical study, the author identified four EF’s operating 
in the discrimination situation and named them stimulus-perseveration, 
differential-cue, response-shift, and position-habit EF. 

The operation of the stimulus-perseveration EF is deduced from 
errors involving repetitive choice of the incorrect stimulus object. An 
exact determination of the number and percentage of errors attributable 
to stimulus perseveration is difficult or perhaps impossible because these 
errors are confounded with those produced by other EF’s. One can, 
however, by various computational devices [21] approximate the number 
and percentage of stimulus-perseveration errors, and a very good estimate 
can be made of the relative importance of this EF during the course of 
both intraproblem and interproblem learning. 

It is presumed that this EF results from the subjects’ stimulus 
preferences, absolute or relative, innate or learned. If an animal’s re- 
sponses were completely dominated by stimulus preference, and if this 
were not influenced by learning, no errors would be made on half the 
problems and 100 per cent errors would be made on the others. Observa- 
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tion of the monkey’s behavior indicates that certain classes of stimuli, 
including large stimuli, tall, unstable stimuli, and many-pointed, bright, 
metallic stimuli characteristically elicit avoidant or hesitant behavior. 
Other stimulus objects elicit positive or approach responses although at 
the present time we cannot define their characteristics precisely. In a 
series of problems generalization from the physical properties of pre- 
viously encountered stimuli, correct or incorrect, must also operate to 
produce preferential responses, either adient or avoidant. 

Even. after considerable pretraining and adaptation, stimulus-prefer- 
ence errors are important. Early in discrimination learning, a monkey 
may choose the unrewarded stimulus on every trial of a six-trial problem, 
and frequently it makes four or five consecutive errors. To date, several 
variables have been found to affect the course of stimulus perseveration 
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during LS formation. The actual number and percentage of stimulus-per- 
severation errors will be a function of the criterial trial or trials chosen, 
amount and nature of adaptation, discrimination learning experience, 
and the stimuli chosen for presentation. The age of the animal appears 
to be important in that stimulus perseveration is stronger and more per- 
sistent for young monkeys than for mature monkeys with comparable 
laboratory experience. The importance of stimulus perseveration, of 
course, varies with successive blocks of problems during LS formation. 
Figure 7 shows the reduction of perseverative errors during the course 
of testing on 312 discrimination problems [21]. It will be noted that both 
the numbers of consecutive errors and the maximal length of con- 
secutive-error sequences are progressively reduced during learning. 
Usually, the response tendencies producing stimulus perseveration are 
among the first to be suppressed or inhibited. 

Bearing on the learned generalization aspect of stimulus perseveration 
and its suppression is an experiment by Riopelle [57], who studied the 
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role of specific stimulus objects in a succession of problems with rhesus 
monkeys by presenting each day five discrimination problems on a six- 
trial basis and then presenting, again, the stimulus objects from either 
the first or the fourth problem of the day. The reappearance of the 
objects, however, was with reversed reward values, i.e., a discrimination- 
reversal problem. Five new problems and one ‘reversed problem were 
tested each day for 63 days. Learning data on the new problems over 
the course of the experiment closely paralleled other discrimination LS 
data. Reversal-problem learning developed more slowly. Until midway 
in training, the monkeys chose the previously correct object on trial 1 of 
the ‘reversal problems significantly more often than chance, but late in 
training they approached chance on trial 1 performance. Through the 
first 21 days, the subjects showed marked stimulus perseveration on trial 
2 of the reversals, but during the last 42 days, performance closely 
paralleled that on the new problems. For trials 3 to 6, stimulus persevera- 
tion was marked on reversals only during the first week of training. Com- 
parison of performance on reversals utilizing problem 1 and problem 
4 revealed that negative transfer was greater for problem 4 reversals 
through the first two-thirds of training, but during the last third there 
was no difference. Riopelle termed these phenomena of progressive re- 
duction in transfer “transfer suppression,” and his findings provide 
evidence that monkeys are able to transfer learned stimulus values and, 
more important, they can learn to suppress this transfer effect. 

A more persisting EF than stimulus perseveration is that of differential 
cue. The operation of the interacting response tendencies which produce 
these errors was described in detail and given theoretical importance by 
Spence [64] in 1936. Spence pointed out that on any particular trial 
of a nonspatial discrimination there is ambiguity between the object 
rewarded and the position rewarded. When a subject displaces a stimulus 
and obtains a reward, both the stimulus object hiding the food and the 
position of the object and food are simultaneously rewarded. To solve 
the nonspatial problem, however, the animal must learn to choose the 
object previously rewarded whether or not on a particular trial it occupies 
the position previously rewarded. 

The importance of differential-cue errors may be determined by com- 
paring the difference in the number of errors made, e.g., on trial 2 of 
problems in which the position of the correct stimulus does not change 
from trial 1 with problems in which the position of the correct stimulus 
does change. If the number of problems under each of the two condi- 
tions is equal, and if the number of errors made on the problems with 
changed position is divided by the number of errors made on the prob- 
lems with unchanged position, one obtains a ratio, the comparable-trial 
error ratio (CTE ratio), which approaches unity as the response 
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tendencies toward position are inhibited or suppressed. By holding 
position and object constant for more than two trials and meeting the 
conditions of changing the position of the correct stimulus on half the 
problems, it becomes possible to measure differential-cue errors on trials 
subsequent to trial 2. 

The combined existing data on the differential-cue EF indicate that 
it is considerably more persisting than that of stimulus perseveration. 
Harlow’s [21] data (see Fig. 8) showed that this EF persisted on both 
trials 2 and 3 throughout the first 200 problems but was not important 
during the next 112 problems. Braun, Patton, and Barnes [6] reported 
the differential-cue factor to be operating during problems 289 to 384, 
and it apparently was operating during a portion of the next 96 prob- 
lems. The results reported by Davis, McDowell, and Thorson [9] are 
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similar to those obtained by Braun et al. The differences probably result 
from the varying number of trials given per problem—six by Harlow, 
three by Braun et al., and four by Davis et al. 

It would be predicted that as the number of trials of ambiguous re- 
ward increases before the shift of the rewarded object to a new position, 
the greater would be the error frequency on the shift trial and, conse- 
quently, the higher the comparable-trial error ratio. This prediction is 
generally borne out by the data [6, 9, 21]. 

Perhaps the most persistent EF is response shift, which may be 
described as a strong tendency of the monkey to respond to both stimuli 
in the object-discrimination learning situation. Response shift was initially 
discovered in the six-trial discrimination problem when the data re- 
vealed that more errors were made on trial 4 following three successive 
correct responses than following an initial error and two subsequent 
correct responses. The fact that fewer errors were made on trial 4 after 
the sequence with the initial error trial is particularly surprising because 
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the initial incorrect choice implies that the trial 1 negative object was 
preferred or, at least, that it had no strong negative value for the 
animal. 

In Harlow’s [21] original analysis, response-shift errors were demon- 
strated only on trials 4 to 6, being obscured on the earlier trials by other 
confounding error sources. Davis, McDowell, and Thorson [9], on 
the other hand, were able to demonstrate response-shift errors from trials 
2 through 4 during problems 257 to 448. By this time the confounding 
EF’s of stimulus perseveration and differential cue had been suppressed 
or inhibited to a degree that permitted the more persistent response-shift 
EF to be identified in the earlier trials of the individual problems. 

The existence of the response-shift EF had doubtless been demon- 
strated in 1947 by Moss and Harlow [50], who used the technique of 
training to a single stimulus and then testing in the conventional two- 
stimulus situation on the following trial. The monkey was presented on 
trial 1 with a single stimulus object covering one foodwell; the other food- 
well was uncovered. On half the problems, the trial 1 stimulus was 
rewarded, and on the remaining problems it was not rewarded. Trial 2 
presented the previously encountered stimulus over one foodwell and a 
new object over the other foodwell. The trial 1 reward condition also 
prevailed on trial 2; i.e., if the single stimulus was rewarded on trial 1, 
it was rewarded on trial 2, and the new stimulus was unrewarded. If the 
trial 1 stimulus was unrewarded, it was also unrewarded on the second 
trial, and the new stimulus was rewarded. 

This method gives unequivocal evidence of a response-shift factor 
since performance following an unrewarded stimulus is clearly superior 
to performance following a rewarded stimulus, and this difference re- 
mains constant throughout training on hundreds of problems [21, 26]. 
If the single stimulus used on trial 1 is placed over a centered foodwell 
rather than over one of the two peripheral foodwells, the typical 10 per 
cent error difference quintuples (unpublished experiment by Schrier and 
Harlow). This almost unbelievable demonstration of the power of the 
response-shift EF is in keeping with its importance and persistence and 
indicates why no investigator has found response shift to be completely 
suppressed or inhibited even though hundreds of problems involving 
thousands of trials may have been completed by every subject. 

We conceive of the response-shift EF as resulting from an exploratory 
tendency [24] in the subjects. Furthermore, it would appear that this is 
primarily a tendency to explore the test situation in terms of its incentive 
possibilities. This is particularly striking in that it occurs in animals with 
a history of extensive experience with food incentives. Additional data 
bearing on this point will be presented subsequently. 

Response shift could well be a function of a number of interacting 
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variables, one of which might conceivably be work inhibition. We regard 
this explanation as extremely remote, however, since Davis [8] found 
little evidence that work inhibition is either an important or consistent 
variable influencing performance of rhesus monkeys. Response shift in 
monkeys may well be related to the spontaneous alternation originally 
reported in rat maze-behavior by Dennis [10]. 

Position-habit errors, which are consistent responses to either the 
right or left foodwell regardless of the position of the correct object, 
appear to represent an essentially unimportant error source for primates. 
Although many monkeys show clear-cut preference for either the right 
or left side of the cage, these preferences operate as a negligible error 
source in discrimination learning. The data [6, 9, 21, 57] show that 
positional error tendencies either do not exist or are suppressed during 
the first hundred problems. A most significant finding of Riopelle [57] 
is that the monkey’s trial 1 choices reflect positional preferences through 
hundreds of problems although these preferences are apparently com- 
pletely suppressed during the subsequent trials of the individual problems. 

The four error-producing factors which we have described as operat- 
ing in the discrimination learning situation are not regarded as com- 
prising a complete system of EF’s for either the discrimination learning 
problem or any other problem. Any condition which leads to in- 
discriminate responses to either or both of the stimuli will produce 
errors, and included in such conditions are external disturbances, internal 
emotional disturbances, and inadequate attention. All the described states 
would tend to produce random or unsystematic errors, i.e., errors without 
regard to objects, position, or arrangement of trial sequences. 

It is obvious that the number of EF’s varies in terms of the com- 
plexity of problems. A spatial discrimination with identical objects would 
eliminate any possibility of the operation of the stimulus-preference and 
differential-cue EF’s. The oddity problem involves all the EF’s described 
as operating in object discrimination learning and other EF’s too [49]. 
Particularly important here is the rewarded-object perseveration error, 
the tendency to respond to the object last correct without regard to the 
configurational cue of oddity. Similarly, the oddity-nonoddity problem 
must introduce at least one additional EF, a tendency to respond in terms 
of the preceding configurational cue. 

Recently Levine and Schrier have suggested a reformulation of the 
definition of EF’s. They define EF’s as unique patterns of responses made 
to irrelevant aspects of the stimulus situation, i.e., responses to those 
stimuli which are not correlated with the reward to the same degree as 
the correct object or cue. The irrelevant stimulus acts as an “error-factor 
producer” (EFP) when the responses of the subject are correlated with 
the stimulus for a given number of trials, Thus, in object discrimination, 
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position preference means that the subject is responding consistently 
to one side. Stimulus preference means that the subject is responding 
consistently to the incorrect stimulus. 

They distinguish four classes of stimuli which may serve as EFP's. 
The first class contains those cues which are spatially differentiated, 
though constant from trial to trial. This includes the cues associated 
with each position in object discrimination and is the basis of position 
preference. The remaining three classes contain the cues which change 
from trial to trial. The second stimulus class consists of changing cues in 
what is traditionally described as the external environment. All environ- 
mental stimuli which appear and disappear from trial to trial, or which 
shift position or change in some way, are potential EFP’s. The animal 
may respond systematically to these. Whether it does or not will depend 
on its capacity to perceive these changes and on the laws governing 
the manner in which stimuli gain control over behavior. One may note 
here that irrelevant stimuli which vary in this manner will, if responded 
to by the animal but not perceived by the experimenter, produce be- 
havior which appears random, i.e., uncorrelated with any aspect of the 
situation, A tester, e.g., who periodically unconsciously tapped his foot, 
would be the source of such a stimulus. The third class of stimuli 
contains those which result from the behavior of the subject. A response 
to the right side yields different stimulation from a response to the left 
side. An organism, then, can respond to its own preceding responses and 
yield such behavior sequences as response alternation, double alternation, 
and so forth. How long a sequence of behavior the animal responds to 
again depends upon its capacity and can probably be decided only 
empirically, Included in this class of stimuli are those deriving from 
emotional reactions and preresponse movements. Since these stimuli are 
normally not recorded by the experimenter, they probably constitute the 
major source of random response sequences. The fourth class of stimuli 
to which the animal can respond comprises rewards and nonrewards on 
preceding trials. Response shift is considered to be a factor in this class. 
If the animal goes to object A consistently on the first few trials and 
then switches to the B object, the interpretation would be that the 
response-reward sequence on the first few trials has determined the 
switch on the critical trial. Another type of factor falling into this 
category is that of “rewarded-object perseveration,” in which the sub- 
Ject’s response is determined by whether or not it was rewarded on the 
Preceding trial. 

The authors point out that certain implications follow from this 
systematic cataloguing of the stimuli: 

1. Random behavior may be defined as response sequences which 
do not correlate with any of the recorded changes in the stimulus situa- 
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tion. Since the major source of stimuli producing random sequences is 
probably the behavior of the subject, they predict that animals that are 
very active between trials should yield a more random-appearing 
sequence of responses than placid animals. 

2. In delayed response, anything which eliminates the ability to re- 
tain the cue information, such as a prefrontal operation, would alter 
the pattern of EF’s. Whereas a normal animal just learning delayed re- 
sponse can respond to its preceding behavior and preceding rewards, the 
animal no longer able to retain information over the delay period should 
no longer be able to retain the information about its preceding response 
or the preceding reward condition, since these took place even before 
the critical cue. With such an animal, one would expect either an in- 
crease in random sequences or in the frequency of position preferences. 

3. The oddity task has the feature that learning can be manifested 
on the first trial of a given problem. If the task is arranged so that a 
series of one-trial problems is presented, then, from the standpoint of 
EF’s, the situation becomes simpler than it is on a single, multiple-trial 
oddity problem or a series of multiple-trial problems. In the latter the 
subject can alternate objects, can manifest object shift, and can show 
rewarded-object perseveration. The last is a particularly powerful EF’. 
These EF’s cannot occur in the one-trial learning situation. They predict, 
therefore, that the animal would learn oddity faster with one-trial prob- 
lems than it would with the equivalent number of trials presented in a 
single problem or series of multiple-trial problems. 

Levine and Schrier have also considered the relation of problem dif- 
ficulty to EFP’s. They acknowledge that tasks can be conceived of as 
falling along a continuum of number of EFP’s, and they offer some 
evidence that the greater the number of EFP’s, the more difficult the 
task. They argue, however, that the converse is not true. One cannot 
say that the more difficult a problem, the more EFP’s present. Thus, in- 
creasing the similarity of the stimuli to be discriminated increases the 
number of errors but does not alter the number or nature of the EFP’s. 
The patterns and frequency of the EF’s themselves would be different 
in the two situations, but the stimuli which control the behavior in one 
situation would also exist in the other. It is as though increasing the 
similarity of the stimuli increases the potentiality of the irrelevant stimuli 
for eliciting the EF’s. Other variables which affect errors without altering 
EFP’s are spatial discontiguity of the objects from the foodwells, the size 
of the patterns to be discriminated, and in general, the detectability of 
the stimuli to be discriminated. Thus, to increase problem difficulty, one 
can increase the number of EFP’s or one can increase the eliciting 
potential of the irrelevant cues which already exist by making the 
relevant stimulus harder to discriminate or detect. 
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APPLICATION OF ERROR FACTOR THEORY 


Although a wide range of learning tasks can be ordered in terms of 
the complexity of EF’s which operate, EF theory offers little new in so far 
as classification of problem difficulty per se is concerned. The progressive 
increase in number of ambiguous factors as we pass from spatial dis- 
crimination, to object discrimination, to increasingly complex multiple- 
sign problems has already been described [22]. Error factor analyses, 
however, permit us to predict and subsequently to validate the nature 
of the underlying conditions producing positive and negative transfer 


100 FS 22 
Ree we, 
Ye 
3 T 
a 
g 
o 
e 
{7 
è 80- 
5 = 
2 r 
È [4 
8 70H / 
E œ--—-- Discrimination 
S reversal problems 
2 60H RREAN 
= e—— Discrimination 
problems 
50 l BEL TE ee 
Fo OW Aor ao ot en 
Tey Seon Sen eae 
1 
ul Toy Mee Hoe an! Ifo? 
GOP ES ee 
Problems 


Fic. 9. Comparison of discrimination learning set curve 
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among many problems and provide parsimonious explanations for some 
Previously inadequately explained phenomena. 

There is an impressive literature which discusses in theory the relative 
difficulty of a discrimination and a discrimination-reversal, the common- 
sense assumption being that it is much harder to change a well-learned 
response than to learn it. Having been reared in this tradition, the author 
was unprepared to find that rhesus monkeys that had formed a dis- 
crimination LS formed a discrimination-reversal LS with considerable 
facility, as Fig. 9 indicates [20]. 

Only after we had developed EF theory did the answer become ob- 
vious. The learned inhibition of all the error-producing factors which 
Operate in discrimination learning transfers positively to the discrimina- 
tion-reversal learning situation with a single exception, that of stimulus 
Perseveration. Successful discrimination and discrimination-reversal learn- 
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ing necessitate suppression of positional, differential-cue, and response- 
shift EF’s. The only negative effect of the previous discrimination learn- 
ing on reversal is the exaggeration of the stimulus-perseveration EF. The 
original learning produces a high object-preference which must be over- 
come in the reversal phase of the problem. But this analysis makes it 
obvious that there are more factors exhibiting positive transfer than 
negative transfer, and furthermore, that positive transfer operates on two 
of the more persistent EF’s, 

Reid’s [56] apparently puzzling finding that overlearning facilitates 
rather than hinders black-white discrimination-reversal learning has been 
confirmed [55], and a number of explanations have been offered. Per- 
haps the simplest explanation is the one we have presented above: dis- 
crimination learning and reversal are not antithetical processes. So far 
as EF elimination is concerned, the two learning problems have more 
in common than at variance. From this point of view, there is no 
reason to predict anything but increased reversal efficiency with dis- 
crimination overlearning. Certainly there is no reason at the present time 
to assume, as do Reid [56] and Pubols [55], that a different kind of 
learning, a “response of discriminating,” takes place, particularly during 
the overlearning period. 

Any radical change in a problem situation, such as shifting from 
discrimination to oddity, oddity to nonoddity, or discrimination to dis- 
crimination-reversal, results in a very striking phenomenon: the first 
problem or block of problems following the change may result in a 
cascade of errors, accompanied by other behavioral signs such that the 
subject may be described as “emotional,” “disorganized,” or “frustrated.” 
This finding has been frequently reported in series of black-white dis- 
crimination-reversals for mature rats [see 55, 56] and infant monkeys 
[27]. The first reversal may entail a very large number of errors, perhaps 
more errors than were made in the original learning. The subsequent 
reversals and rereversals are usually made more efficiently, with error 
scores falling below those in the original learning. Similar results have 
been reported for problem changes and reversals in a wide variety of 
learning situations. Changing a solvable problem into one that is in- 
solvable produces extreme examples of disruption, which in this instance 
cannot be overcome. The resulting behaviors have been described in the 
frustration literature, which is outside the scope of this paper. 

It is our belief that EF analysis offers possibilities of determining the 
effects of reversal or change in a problem more efficiently and objectively 
than these changes have previously been analyzed. We believe that 
change in a problem will result in the reappearance of previously sup- 
pressed EF’s, and that the nature of this EF reappearance can be pre- 
dicted with considerable success in terms of the nature of the problem 
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and the subject’s past history. Furthermore, we would predict that the 
degree of problem change or shift and the resulting increase in problem 
difficulty will be related in a quantitative manner to the type and 
number of EF’s which reappear. 

The actual test of this hypothesis is difficult since one must show 
not only that problem change or reversal is disruptive but that the 
nature and the order of appearance of the suppressed EF’s which this 
change releases are highly systematic. An approach to the problem has 
been made by Moon and Harlow, who studied oddity learning in 
monkeys that had previously learned object discrimination [49] and 
uncovered two categories of errors that appeared only when the subject 
was attempting solution of the problem in terms of the reactions ap- 
propriate to object-discrimination learning. Subsequently, Moon and 
Harlow analyzed the oddity-nonoddity learning of the same monkeys 
and found that during the first hundred oddity-nonoddity problems the 
monkeys rapidly came to respond to the nonoddity trials as if they were 
object-discrimination trials. In other words, their responses were not 
appropriate to the previously learned oddity problem, but to a more 
primitive problem, object discrimination. 

We would predict that had the nonoddity problem been unsolvable 
and had testing been continued, the typical form of the errors would 
have become progressively more primitive, regressing to position habits, 
stimulus perseveration, and eventual randomness—inattentive or emo- 
tional responses. The experimental testing of this hypothesis poses a few 
difficulties, but it unquestionably can be done. In favor of such a hypoth- 
esis there already exists a considerable body of quantitative and semi- 
quantitative data of a nondefinitive type. 

This concept of regression to more primitive error patterns is 
reminiscent of Krechevsky’s classical studies of “hypotheses” in rats [39]. 
Subjects tested on insolvable problems for hundreds of trials showed fre- 
quent periods of consistent behavior such as response to the right position, 
left position, light selection, or light avoidance. Furthermore, these islands 
of consistent, patterned behavior were commonly surrounded by long 
series of random or inconsistent responses. Even though different rats 
and strains of rats showed preferences for different response patterns, 
the general picture is quite clear: Krechevsky was probably measuring 
unlearned patterns, not patterns arising from change or shift in a 
previously learned response. 

Progressive reappearance of previously suppressed EF’s following 
protracted failure in a problem-solving situation would appear to be a 
mechanism of high adaptive value. If an animal possesses a repertoire of 
LS’s permitting facile solution of a half-dozen problems, it should utilize 
all these patterns or “hypotheses” when a new problem appears. Reaction 
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of the animal with initial randomness upon the appearance of any new 
problem would represent a waste of its previous learning. The initial use 
of existing LS’s and the consequent appearance of complicated EF 
patterns is probably efficient, and regardless of efficiency, it appears as 
a rather generalized approach by all higher primates. Indeed, one of the 
difficulties from which human subjects often suffer when faced with a 
simple problem is a stubborn insistence on trying to solve it by some 
hypothesis or LS of a level of complexity inappropriate to the presented 
task. 

If it is assumed that learning is the elimination of all EF’s operating 
within a particular problem, it should be possible for animals to learn 
problems partially by eliminating individual EF’s before formal training 
in the test situation begins. Actually, psychologists have been doing this 
for decades, e.g., “adapting” rats on a straightaway before training them 
on a multiple-unit maze, thereby doubtless reducing error-producing 
factors in advance of the “learning” situation. 

At Wisconsin we systematically engage in “adaptation” procedures in 
order to prepare the animal emotionally for experimentation, but this 
preliminary training must also reduce EF’s, though we have conducted 
few formal experiments to test this assumption. We consistently start 
experimental work with naive monkeys by giving them experience on 20 
objects presented individually over a single, centered foodwell. We 
probably reduce stimulus preference by this technique, but we have 
not as yet conducted a formal experiment to determine the extent to 
which this is accomplished by our single-stimulus preliminary training. 

An intriguing experiment by Jarvik [32] demonstrates that essentially 
perfect discrimination performance can be accomplished without any 
paired-stimulus training. Jarvik [32] soaked small square pieces of bread 
in either a saccharin solution or a solution containing capsicum, dehydro- 
cholic acid, and quinine and dyed one kind red and the other, green. 
Half the subjects had the saccharin bread red, half green. The animals 
were given preliminary experience with one piece of each color of bread, 
the positive or saccharin-soaked bread being presented first. The subjects, 
10 chimpanzees and 8 monkeys, were then tested for 25 trials with both 
stimuli present. No errors were made by 12 of the subjects, and a 
maximum of 2 errors was made by 2 chimpanzees. These data clearly 
show that immediate discrimination learning may be effected by estab- 
lishing strong stimulus preferences before instituting formal training. 

The importance of stimulus preferences is demonstrated by data that 
we are presently obtaining on groups of 60-, 90-, and 120-day-old 
monkeys and on 16 young adults. Approximately 25 per cent of the 
animals, on their initial discrimination problem, which in this experiment 
is 500 trials in length, solved the problem with few or relatively few 
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errors. Strong preferences apparently exist in some monkeys for certain 
stimuli in spite of our best efforts to prevent such an occurrence. 

In 1944 the author developed the technique of adapting monkeys to 
the WGTA by presenting a series of stimuli, one at a time, on the test 
tray. Each stimulus was presented an equal number of times over the 
right and the left foodwell, and each trial was rewarded. The extremely 
efficient performance by the subjects on their initial and subsequent dis- 
crimination problems [19] suggested that the adaptation procedures 
actually constituted training procedures. 

The truth of this presumption has now been established by an un- 
published study by Harlow and Meyer, who demonstrated that the above 
procedures, which never involve paired-stimulus presentation, reduce 
the differential-cue EF significantly. The monkey learns to suppress the 
effect of reward of position and respond consistently to object rewards. 
Lest it be suggested that this is accomplished by eliminating position 
preferences, it should be stated that direct tests indicate that these 
procedures have not affected position preferences. Position preferences 
existed before these single-stimulus training procedures and equally 
strong preferences existed afterward. Since obtaining these results, we 
have abandoned the two-hole board for our standard adaptation tech- 
nique and resorted to the one-hole board, for we prefer to minimize 
learning prior to the formal experimentation. 

Still another error factor has been demonstrated to be amenable to 
experimental manipulation: response shift. In this instance the EF was 
exaggerated, not reduced, through a modification of reward conditions. 
Schrier and Harlow [62] tested eight Java monkeys on 432, 10-trial 
discrimination problems, three problems per day, rewarding the subject 
with one, two, or four food pellets for correct responses within a problem. 
The three reward conditions were balanced over each three-day test 
period. Over-all learning curves, i.€., curves based on trials 1 through 10, 
were progressive for all incentive conditions combined and for each 
incentive condition plotted independently. Frequency of response-shift 
errors was inversely related to the amount of incentive, but neither dif- 
ferential-cue errors nor position-preference errors related in any system- 
atic way to incentive amount. Analysis of trial 2 responses following 
correct trial 1 responses showed an interesting relation to incentive 
amount. Such responses under intermediate- and high-reward conditions 
showed a fairly constant percentage of errors over the course of the 
experiment, but under the low-reward condition the percentage of errors 
increased significantly from 40 per cent in the first third of the experi- 
ment to 50 per cent in the next third and 65 per cent in the final third. 
The monkeys actually had “learned” to perform below chance on trial 
2, although they received no food reward for these response-shift errors. 


526 HARRY F. HARLOW 


In summary it may be stated that in spite of the limited research con- 
ducted thus far on EF analysis, a number of interesting phenomena are 
evolving. It is a reasonable hypothesis that the suppression of all EF’s 
defines perfect learning, and that learning is nothing but suppression or 
inhibition of EF’s. Several EF’s have been shown to be subject to sup- 
pression or augmentation by training outside the test situation. Finally, 
in-at least one case, we find completely different interaction between an 
incentive condition and the EF’s operating within a problem. These data 
suggest that EF’s are functional units whose suppression underlies 
learning. 


LEARNING SET, ERROR FACTORS, AND UNIPROCESS 
LEARNING THEORY 


The development of the thesis that learning involves nothing other 
than the elimination of responses and response tendencies inappropriate 
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Fic. 10, Discrimination learning set curves based on trial 
2 responses following rewarded and unrewarded trial 1 
responses. Reproduced from Psychol. Rev. [26]. 


to a particular learning situation leads us to question the commonly held 
concept that learning is the resultant of two processes, excitation, which 
follows reward, and inhibition, generated when no reward is received. It 
seems more consistent with EF theory to assume that instead of two 
processes only a single process underlies learning, and the reasoning 
favoring this assumption is presented at length in a recent paper [26]. 
Also reported in that paper is an experiment testing duoprocess as 
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opposed to uniprocess theories by means of LS methods. We found under 
carefully controlled conditions that trial 1 reward and trial 1 nonreward 
give rise to LS curves which are alike in form and parallel each other 
throughout their course. Furthermore, the constant difference between 
the curves, as shown in Figs. 10 and 11, can be explained by the 
operation of the independently measured response-shift EF, Constant dif- 
ferences are, however, immaterial inasmuch as it is differential curve 
form rather than a constant rate difference which would result if reward 
and nonreward showed differential strengthening and weakening effects 
during the course of learning. 
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Fic. 11, Discrimination learning set curves based on trial 
2-6 responses following rewarded and unrewarded trial 1 
responses. Reproduced from Psychol. Rev. [26]. 


Since the publication of the study just discussed, we have com- 
pleted two additional experiments testing the contrasting duoprocess- 
uniprocess learning theories. A group of 10 monkeys was tested on a total 
of 720 two-trial discrimination problems. A single stimulus object was 
presented on trial 1, and it was rewarded on half the problems and not 
rewarded on the others. After a delay of 5, 10, or 20 sec, trial 2 began, 
introducing a pair of stimuli, which included the singly presented 
stimulus of trial 1. The LS curves for these two-trial problems with 
intertrial delay lack the regularity of the LS curve for six-trial problems 
without intertrial delay, but the form of the six trial 2 curves following 
rewarded and nonrewarded trial 1’s, plotted independently for reward 
condition and delay condition, is similar for each delay interval. The 
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(c) 
Fic. 12. Discrimination learning set curves for intertrial delay problems. (a) 5-sec. 
delay. (b) 10-sec delay. (c) 20-sec. delay. In each instance, the solid-line curve 
is for the trial 1 reward condition and the broken-line curve is for trial 1 not 
rewarded, 


differences in performance following the various delay intervals are 
not great, but the relative superiority of the nonrewarded over the 
rewarded trial remains quite constant at all delay intervals (see Fig. 12). 
Certainly, these data give no indication of the operation of two different 
mechanisms, one associated with reward and the other with nonreward. 

The next experiment involved testing the same 10 monkeys on a 
total of 960 delayed response trials, 120 trials on each of four delay- 
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conditions, 0, 5, 10, and 20 sec, and for each of two prechoice or in- 
centive conditions. Each trial was initiated with one of the two identical 
stimulus objects covering one foodwell on a tray and the second stimulus 
2 in. behind the remaining foodwell. For one incentive condition, food 
was visible in the uncovered foodwell. The experimenter tapped three 
times behind the well, then covered it with the block and started the 
timer. At the end of the appropriate delay period, the tray was pushed 
forward to within the animal’s reach. For the second incentive condition, 
food was in the covered foodwell. The experimenter tapped behind the 
uncovered foodwell, which was visibly empty, then covered it with the 
second block. The remainder of the trial procedure was the same as for 
the first condition. 

Superior performance was obtained 
for those trials in which the subjects 
saw food in the well during the pre- 
delay period. Long experience with de- 
layed response testing suggests that the 
critical factor is attention; it is easier 
to attract and hold the animal’s at- 
tention if it sees the food in the well 
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than if it sees only an empty well. 
The effects of the delays were thus 
much greater than in the discrimina- 
tion learning experiment. The curves for 
the two incentive conditions (Fig. 13), 
which are based on the percentage of 
correct responses as a function of delay, 
are not significantly different in form, 
and the advantage of the visible reward 
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formance as a function of in- 
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over the nonvisible reward presentation condition is consistent. When 
the data are plotted separately for each of the eight conditions, essen- 


tially parallel learning curves are found for the 


0- and 5-sec delay 


intervals, and the 10-sec curves parallel the shorter-delay curves except 
for the last point. At the 20-sec delay interval the visible reward presenta- 
tion produces a smooth, negatively accelerated curve, but the nonvisible 
‘reward condition yields a flat, chance curve which fails to rise above 56 


per cent at any point. 


In general, we believe that the data of these two experiments give 


strong additional support to the uniprocess 
decay in both experiments produced compa 


learning thesis. Temporal 
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of trial 1 reward and nonreward in one instance, and visible reward and 


nonvisible reward in the other. Generally, 
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produced parallel learning when delay interval was held constant. The 
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sole exception was a failure to obtain any learning under one test 
condition. 

The concept of uniprocess learning is by no means original. Guthrie 
[17] has long espoused such a position, and Hebb [30] can be similarly 
categorized. Both these authorities imply that the fundamental process is 
one of growth or excitation. Analysis of EF elimination led us, however, 
to examine the possibilities of a uniprocess position based on inhibition. 
Later, speculation on the nature of learning per se seemed to be con- 
sistent with an inhibition learning theory, for we could find no evidence 
that learning evolves from the formation of new associations. To this 
iconoclastic statement we now give fuller explanation. 

Let us first consider what learning really is. To obtain learning in 
any situation we must have a subject motivated to achieve an incentive, 
and the situation is so arranged that there are various impedimenta of 
varying orders of complexity in the way of the subject’s response to the 
incentive. The subject must circumvent these impedimenta if it is to 
learn. In the Skinner box the various patterns of temporal ordering are 
these impedimenta, and on the Klüver tray in the WGTA they are the 
patterns of objects placed over the foodwells. In a very real sense the 
subject in either of these situations, and in all other learning situations, 
is not responding to the impedimenta; it is responding to the incentive, 
and it is responding in spite of the impedimenta. Observation of re- 
sponses of monkeys in the WGTA situation makes it obvious that the 
monkey is responding to the food, and the animal makes or attempts 
the food-getting response in spite of the road-blocking stimuli. So long 
as an animal actually responds to the stimuli and not the incentive, 
learning is inefficient or impossible. During the adaptation process, we 
attempt to prevent our monkeys from responding to the stimulus objects 
by chewing them, tearing them, or playing with them in the cage. Those 
deviant monkeys that persist in responding to the stimuli per se never 
learn, and they make their scientific contribution in subsequent neuro- 
physiological experiments. The difficulty encountered in training infant 
monkeys, most chimpanzees, and all human children is their persistent 
tendency to disregard both appetite and hunger and to respond to the 
stronger motives afforded them by the impedimenta. 

Successful monkey learners, regardless of the situation, gradually in- 
hibit responses and response tendencies interfering with the minimal 
essential responses of prehending and/or ingesting the incentive, and as 
this is achieved, the latency of the correct response, the response re- 
maining after all the incorrect responses have been inhibited, gradually 
approaches, but never attains, the latency of the direct response to the 
incentive. The responses inhibited include any response to incorrect 
stimuli and any unnecessary response to the correct stimuli. For learning 
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viewed in this way, we see no need to postulate formations of new as- 
sociations. 

Recently the author has reviewed the experimental materials relating 
to the evolution of learning and attempted to fit them into a theoretical 
framework [25]. The learning of primitive organisms appears to be 
nothing more than the gradual suppression or inhibition of extraneous 
response tendencies. In view of modern genetic theory, which holds that 
evolution takes place by the selection of multiple small mutations, it is 
not unreasonable to hypothesize that the fundamental process of learn- 
ing has probably remained essentially unchanged for near-countless 

_ millions of years. Furthermore, there is nothing relating to the slow, but 
progressive, evolution of the nervous system to suggest any great master 
mutation giving rise to a new learning mechanism. For countless genera- 
tions, development has gone along the line of slowly increasing the num- 
ber and complexity of neural structures without evidence for develop- 
ment of essentially new anatomical units which might give basis for a 
sudden change in the nature of learning. Unless the assumption is made 
that there have been one or more large mutations which could have 
resulted in new learning processes, inhibition learning theory is not 
inconsistent with evolutionary theory. Data such as these, of course, can 
never be definitive but must always be judged on a “hest-guess” basis. 


CONCLUDING COMMENTS 


The materials on learning set and error factor theory have now 
been presented. We have described how our data led us to take sides on 
an issue which we believe to be a firm basis for the establishment of a 
system, the issue of uniprocess vs. duoprocess learning. We have made 
a rather arbitrary decision to investigate how far it is profitable to ex- 
ploit the concept that the single learning process is inhibitory, and that 
learning consists only in the elimination of incorrect response tendencies 
elicited in the learning situation. We are satisfied with this arbitrary 
position because we believe it has a high probability of being funda- 
mentally meaningful. 

Experimental and observational data, in large part obtained from our, 
“baby monkey” research since this paper was written, have strengthened 
our opinion that the most important mammalian motives are the in- 
centives provided by the physical stimuli of the external world—extero- 
ceptive, chemoreceptive, and nociceptive. No stimuli of any of these 
classes have developmental priority over the others, and the actual order 
of appearance in many cases is at variance with textbook descriptions. 
Thus, solid foods develop as effective incentives from responses to extero- 
ceptively motivated curiosity-manipulatory responses. If we were to 
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follow the common psychological paradigm of “derived motives,” re- 
sponsiveness to solid food would be a derived motive conditioned to 
curiosity responses. 

It is these external incentives that form the motivating conditions 
constantly eliciting behavior in the waking organism. The animal is no 
more able to avoid responding to these incentives than it is able to ignore 
hunger, thirst, or elimination. Attainment of these incentives in sufficient 
amount produces satiation, and the primary function of drive states is to 
delay this satiation effect. We grant the fact that primary drive states 
may have other functions, such as alteration of threshold states. 

The direct response to these incentives is blocked, made difficult, or 
made variable in learning situations. The suppression of the extrancous, 
or nonadaptive, responses or response tendencies elicited by the blocking 
(problem) agents constitutes learning, but their suppression is not a 
function of the response to the incentive, e.g., the eating of food. The re- 
sponse to the incentive gives the animal information about the character- 
istics of the incentive, such as its quality and quantity. This learning 
about particular incentive characteristics may influence the likelihood 
that subsequently the incentive will consistently elicit responses over less 
prepotent external stimuli simultaneously acting upon the organism. 

We have already presented our basic theoretical position in regard to 
learning, but we will summarize it again. When a psychologist decides 
to study learning in an animal, he initially trains the animal to form an 
expectancy that-some response or sequence of responses in a particular 
situation will be followed by an incentive. This is done by “shaping” the 
behavior of the rat or pigeon in the Skinner box, by training the rat on 
a straightaway before subsequently running it on a maze, and by adapt- 
ing the monkey to the test tray in the WGTA. In all these learning situ- 
ations the animals are trained to make some response or responses that 
lead to an incentive. The subjects learn that responsiveness in a particular 
situation will provide an incentive or that there is a certain probability 
that it will provide an incentive. 

Let us now take any discrimination problem and note what “learn- 
ing” is from this point forward. Having mastered the straightaway, the 
rat may be put in a maze. As Snygg has so clearly shown, it is possible 
to design multiple-unit mazes in such a way that the natural response 
tendencies of the rat will lead it usually to make errorless performances. 
Tt is also possible to devise mazes which will invariably produce errors. 
If the maze patterns are such that they elicit errors, the rat must inhibit 
responsiveness to those segments which lead to errors before the experi- 
menter will state in print that the rat learned the maze. 

The pigeon has been trained to press a lever in a Skinner box accord- 
ing to some predetermined schedule. It is presented with a green light, 
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and the previously introduced schedule of reward is continued. The bird 
is now presented with a red light and not rewarded. At first the pigeon 
responds as it did to the green light in the total test situation, but grad- 
ually it learns to inhibit this response. When the inhibition is complete, 
the pigeon is described as having learned the discrimination. 

When infant monkeys 60, 90, 120, or 150 days of age face their 
first discrimination problem, choice between two multidimensional ob- 
jects on a test tray, their performances fall into two classes. In this learn- 
ing situation, learning to push the object which covers the baited food- 
well, some infants make no errors or as few as five errors before reaching 
the criterional level of learning. The other monkeys make many errors, 
indeed, they may make 90 per cent or more errors on the first 25 trials, 
and as many as 70 errors totally before mastering the problem. The 
meaning of these data is perfectly clear: if the preferred stimulus is the 
one rewarded, “learning” may take place in zero trials; if the nonpre- 
ferred stimulus is the one rewarded, the monkey must come to inhibit 
these response tendencies before learning is complete. 

Our theoretical position differs from that of Hull in that his is a 
duoprocess reinforcement theory and ours a uniprocess nonreinforcement 
position. Similarly, our position differs from Spence’s, whose 1936 treat- 
ment of discrimination learning, like Hull’s, favors reinforcement and 
two learning processes, excitation and inhibition. To Spence, however, 
we are indebted for the concept of differential-cue errors. Guthrie’s posi- 
tion resembles ours in that he favors nonreinforcement, but differs from 
ours in that he proposes the formation of new associations whereas we 
propose the elimination of preestablished nonadaptive responses or re- 
sponse tendencies. Our position has much in common with that of Tol- 
man in the treatment of incentives. Because of the incompleteness of our 
theory at the present time, more detailed comparison with current the- 
ories is best left to the future. 

Our purpose in this paper has been to present a point of view, or 
pretheory, based on a quarter-century of experimental work at the 
Wisconsin Primate Laboratory. Since certain of the present systematic 
considerations are novel, and have not before been brought together in 
this way, the primary initial task was necessarily one of exposition. 
Though at many points questions germane to the Project A discussion 
outline are touched upon, more detailed analysis of the present formula- 
tion would be pretentious at this phase. 
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INTRODUCTION 


In a sense, the reader of this chapter is confronted with an insoluble 
problem. He is being asked to follow an analysis of a system that does 
not exist, for, despite the fact that the earliest experiments in learning 
were concerned with rote learning, and despite the existence of the 
enormous amount of data that has been gathered in this rather limited 
learning situation, no comprehensive, systematic treatment of these data 
exists. Partly for this reason, and partly because the field of rote learning 
is idiosyncratic in other respects, the outline that is being used as a guide 
in many of the other papers in this series cannot be followed in detail. 

The fact that systematic treatment of rote-learning data exists only in 
rudimentary form is rendered even more remarkable by the circumstance 
that the overwhelming majority of rote-learning experiments have fol- 
lowed, more or less closely, the tradition established by Ebbinghaus. 
Thus, the would-be systematizer has not been plagued with data gathered 
by experimenters holding to different systematic and presystematic view- 
points. What the systematic biases of those who did gather these data 
may have been, we shall have occasion to examine presently, and cer- 
tainly no implication is intended that psychologists in general were in 
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agreement with these points of view. It is sufficient to note here that, in 
general, the psychologists who held widely different systematic views 
from those of workers in the area of rote learning tended to reject the 
rote-learning situation altogether either as being artificial or as being less 
fruitful than other areas of learning experimentation. Their opinion that 
rote-learning data are relatively worthless has led them to investigate and 
to speculate about learning with reference to other learning situations, 
thereby leaving the bulk of rote-learning data to be gathered by those 
whose systematic leanings were more or less homogenous. 

The field of rote learning, of course, is not entirely devoid of theory. 
The Mathematico-deductive Theory of Rote Learning of Hull and others 
[7] represents an attempt to bind together a considerable number of the 
independent and dependent variables of rote learning within a single 
systematic framework. Unfortunately, and perhaps because of the for- 
midable nature of the theory, this work does not seem to have had a 
great deal of influence upon subsequent experimentation, and only a 
minute fraction of rote-learning experiments since 1940 have been 
squarely coordinated with the theory. On the other hand, if there has 
been a dearth of comprehensive systematizing in the field of rote learn- 
ing, there has certainly been no shortage of special theories and limited 
hypotheses, This plethora of special hypotheses suggests that those who 
have worked in the field of rote learning have been concerned with the 
explanation of the results of particular experiments or small groups of 
experiments rather than with the incorporation of large amounts of data 
into a single framework. | 

There may be a reason for this. It is well known, of course, that there 
is a logic of science. It is, perhaps, less well understood that there is also 
a psychology of science and that psychologists, no less than other scien- 
tists, are subject to its laws. Such a psychology of science concerns itself 
with the ways in which scientists behave in their scientific endeavors, and 
we should not be surprised to find that their behavior does not always 
conform to the rules of the scientific game, that is, to the logic of science. 
For example, according to the logic of science, no theory can be main- 
tained in the face of one single, well-established, contradictory fact. 
Conant [1] and others, however, have pointed out that scientists do not 
often reject a theory, particularly one that is well established, on this 
basis, Instead, theories tend to persist until more adequate ones are pre- 
sented and, indeed, even under these circumstances, scientists have often 
seemed loath to discard some of their theoretical beliefs. 

It is also true that a theory that appears to be removed from the 
historical context of the time of its emergence may die by neglect rather 
than by disproof. Hull’s theory of rote learning [7] is an example of this 
phenomenon. It is entirely possible that, had this theory been advanced 


540 ARTHUR L. IRION 


at the present time, it would have aroused a considerable amount of 
scientific activity among workers in the field of rote learning. Arriving 
on the scene too early, the theory has had virtually no impact upon the 
research in this area. This fact is one of the reasons why this theory will 
not be discussed in detail in this chapter. 

Now in considering the psychology of science, it is apparent that one 
of the functions of a systematic formulation is to facilitate communica- 
tion among research workers. Although this function of systems has been 
largely neglected, it is possible that this may be one of the chief reasons 
why systems are formulated and gain adherents. This factor may be 
particularly important in a young science in which the preconditions 
for realizing other goals of systematization (of the sort customarily em- 
phasized in the logic of science) may not have been achieved. Thus, 
two students of Lewin can discuss their experimental work with relatively 
complete understanding by making reference to the concepts employed 
by Lewin, even though their words might need a considerable amount of 
translation if they were to be clearly comprehended by a student of Hull 
or Guthrie. It is significant to note, therefore, that those who have 
worked in the field of rote learning have typically “spoken the same 
language.” In this way, they have fulfilled one of the functions of a sys- 
tem, and a very important one. It is possible that, once the communica- 
tion function of a system is established by other means, the need for 
systematization becomes considerably less pressing. If true, this would 
account, at least in part, for the relative lack of systematic theory in the 
field of rote learning. There is another reason, and probably a more 
important one, for the lack of systematization, but we shall discuss this 
cause in a later section. 


BACKGROUND FACTORS AND ORIENTING ATTITUDES 


It is not always true that the person who first works in a particular 
research area continues to dominate the development of that kind of 
research for any great length of time. Concerning the influence of 
Ebbinghaus on the field of rote learning, however, there can be no doubt 
that his influence has been, and continues to be, profound. Twenty-five 
years after the publication of Uber das Gedächtnis [2], Titchener, a man 
who did not bestow indiscriminate praise upon the works of his col- 
leagues, could make the following statement [15]: 


r Here, then, were pure perceptions, sights and sounds that had no mean- 
ing and no associates; here was material so varied and yet so simple, so rich 
and yet so uniform, that experiments could be made under laboratory condi- 
tions, and the results of one experiment could be compared, directly with 
the results of another. 
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It is not too much to say that the recourse to nonsense syllables, as a 
means to the study of association, marks the most considerable advance, in 
this chapter of psychology, since the time of Aristotle. 


Thirty-two years later, or nearly sixty years after the appearance of 
Ebbinghaus’s work, McGeoch began his Psychology of Human Learning 
[9] as follows: 


Systematic experimental study of human learning dates from 1885, when 
Ebbinghaus stated some of its fundamental problems, devised methods for 
studying them, and in many ways set the pattern for later research. His 
monograph, Memory: a contribution to experimental psychology . » +; is 
a landmark in the history of psychology and is still a model which will repay 
careful study. 


Today, seventy years after Ebbinghaus’s initial work, the situation 
has not changed materially. Many of the fundamental problems that 
Ebbinghaus stated are still being investigated and Ebbinghaus’s inven- 
tion, the nonsense syllable, is still in widespread use, as are several of his 
methods of investigation. Of course, the apparently great influence of 
Ebbinghaus is enhanced by an artifact. We tend to count as rote-learning 
experiments only those that follow more or less closely the tradition 
Ebbinghaus created. Thus, the work of Thorndike might not be con- 
sidered by some to be “pure” rote-learning experimentation. In the same 
way, many of the recent studies that employ rote-learning techniques to 
investigate problems that do not fall in the classical rote-learning tradi- 
tion (the use of such techniques to study personality variables, for ex- 
ample) may be ignored by some individuals when they are considering 
the rote-learning literature. Nevertheless, it would be as foolish as it 
would be inaccurate to discount the influence of Ebbinghaus entirely. A 
brief inquiry into the systematic preferences of Ebbinghaus, therefore, 
would seem to be in order. 

Ebbinghaus, whose work belongs to the latter part of the nineteenth 
century, could hardly have failed to have been strongly influenced by 
the British school of associationism and especially by the formulations 
of John Stuart Mill. In being influenced in this way, Ebbinghaus was 
certainly not alone among early German psychologists; in fact, one could 
make a fairly strong defense of the proposition that modern psychology 
was created by the synthesis of British associationism and German natural 
Science. In a sense, the work of Ebbinghaus represents a fairly exact ap- 
plication of the doctrine of associationism to laboratory experimentation. 
The “ideas” of the associationists, into which mind was supposedly analyz- 
able, became the nonsense syllables which formed the units of analysis 
for Ebbinghaus, and quite obviously, most of the “laws” of association 
propounded by the associationists became subject to experimental attack 
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by means of the techniques that Ebbinghaus devised. Beyond this, it is 
possible that the temperament of Ebbinghaus himself may have been 
responsible for the way in which the field of rote learning developed. 
More so than many another of his contemporaries (or ours, for that 
matter), Ebbinghaus was committed to the empirical approach. Thus, 
in his free translation of Ebbinghaus’s Abriss der Psychologie [3], Meyer 
[10] notes in the preface that he “became interested in this book because 
of the fact that the author has succeeded in keeping entirely free of all 
fads, and has presented only that which is generally accepted by psy- 
chological science. . . .” I shall not strain this point, but it is worth 
noting that the bulk of the work in rote learning has maintained this 
tradition, and indeed, this work has more than once been attacked be- 
cause of its crass empiricism, 

One other chain of circumstances that has determined the present 
status of research and theory in rote learning should also be mentioned. 
Learning research, in general, has exhibited a marked change of orienta- 
tion in the comparatively recent past. The earlier experiments were 
largely exploratory in character, and although they frequently involved 
the systematic variation of variables, their usual purpose was to determine 
what the important conditions of learning might be. The later experi- 
ments tended to be crucial experiments that tested opposed theoretical 
predictions. Still more recently, experiments that systematically vary 
variables that are theoretically important have appeared. By the latter 
statement, I am referring to experiments that seek to determine the 
properties of intervening variables. Experiments, for example, that at- 
tempt to determine the functions of development of such intervening 
concepts as sHr, D, sOr, etc., fall within this classification. In such in- 
vestigations, the intervening variables become, through a process of in- 
duction on the part of the experimenter, the dependent variables. This 
general shift of orientation appears to have taken place during the mid- 
dle 1930’s. Of course, a definite date cannot be set. There were certainly 
experiments before 1930 that were theoretically oriented, and in a sense, 
the shift in orientation is still going on. Nevertheless, the fact remains 
that there has been a change of orientation. The important thing, from 
the standpoint of this discussion, is that the shift of orientation in the 
field of rote learning has been much more gradual than has been the shift 
in other areas of learning research. 

If we wish to understand why this should be so, we must keep in 
mind several facts. An examination of the experimental literature of 
rote learning reveals the fact that the overwhelming majority of authors 
have contributed but a single experiment to the literature. That these 
are frequently converted Ph.D. dissertations is something that we shall 
have to consider in a moment. If we plot number of authors against 
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number of papers per author, a typical J function is obtained, só that the 
number of individuals who have contributed 10 or more papers to the 
literature of rote learning is exceedingly small. This means, of course, 
that at any one time the field of rote-learning research tends to be domi- 
nated by a handful of individuals. The same general phenomenon may 
be observed with respect to laboratories. In any given ten-year period, 
most active laboratories will produce one or two papers in the field of 
rote learning while a very few laboratories will be tremendously pro- 
ductive. If we examine the literature of rote-learning experimentation 
for the fifteen-year period immediately preceding the time when learning 
experiments began to shift toward a systematic, theoretical orientation 
(1920 to 1935), we can observe that a small number of persons and 
schools dominated the scene. 

The University of Chicago, for instance, clearly led the field with 
respect to number of papers produced, and although most of these were 
of the one-paper-to-an-author variety, footnote credits as well as the 
nature of the experiments make it clear that the influence of such men 
as Carr and Robinson must have been considerable. It would be more 
difficult to select the individuals who exerted the greatest influence on 
the development of rote learning during this period, but Carr, McGeoch, 
and Robinson would unquestionably be among the first five or six indi- 
viduals to be named in this respect. Thus, the general empirical biases of 
Chicago Functionalism were well represented. Those who knew John 
McGeoch, who was undoubtedly the most important single contributor to 
the rote-learning literature of his period, may have objected to certain 
aspects of his work, but I have never heard him criticized for spending 
a disproportionate amount of time and effort attempting to systematize 
large amounts of rote-learning data in a single, precise, theoretical 
framework. 

It is also, perhaps, significant that there appears to have been a con- 
siderable amount of academic inbreeding among those who worked in 
the area of rote learning. McGeoch was a student at Chicago in the days 
of Carr and Robinson. Melton was a student of McGeoch’s at Washing- 
ton, then was with Robinson at Yale, and later returned to succeed 
McGeoch as head of the department at the University of Missouri, which 
was, in the days of McGeoch and Melton, one of the important centers 
of rote-learning research. Bunch, of course, was a student at Chicago in 
the days of Carr. Hovland was at Yale in the days of Robinson, although 
in this case the influence of Hull is clearly discernible. Underwood 
studied under Melton at Missouri and McGeoch at Iowa and has since 
become one of the major contributors to this area of research. Unques- 
tionably, the preceding few sentences present an oversimplification of 
the setting in which rote-learning research has been accomplished. Never- 
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theless, it is instructive to note that approximately 40 per cent of the 
rote-learning studies published in the Journal of Experimental Psychology 
during the five years from 1949 through 1954 were written by students 
of Garr, McGeoch, or Melton. 


SOME CHARACTERISTIC PROBLEMS AND THEORIES 


In the field of rote learning, a relatively small number of phenomena 
has received the bulk of research attention, and in each case, one or more 
relatively cogent hypotheses have been advanced. Before turning our 
attention to these subjects, however, a word should be said about three 
problems that underlie most of the research in rote learning. 

The problem of the unit of material to be learned. One assumption 
has been made, more or less implicitly, from the time of the earliest re- 
search down to the present day. This assumption is that, for purposes of 
experimental design and (sometimes) analysis of the data, the proper 
unit of the material to be learned is the individual item (or, in the case 
of paired-associate learning, the pair of items) in the list to be learned. 
Stated in another way, the assumption is that the characteristics of a 
list are determined in a relatively simple way by the characteristics of 
the individual items making up the list. It was on the basis of this as- 
sumption that much of the work on the calibration of materials by Glaze 
[4] and others [5, 20] was undertaken. The assumption is a natural one 
to have made considering the heritage of classical associationistic thought 
and the nature of the materials themselves—the materials being lists com- 
posed of discrete items or pairs of items. 

The matter is not so simple, however, for there is a considerable 
amount of evidence to indicate that, at least under certain circumstances, 
the organization of the list as a whole determines the characteristics of 
the individual item. This is another example of a familiar problem, the 
neglect of possible interaction effects. There are numerous suggestions in 
the published literature that such interaction effects exist. Perhaps the 
most striking demonstration of these effects was obtained by Hall [6]. 
In a preliminary experiment, Hall scaled nonsense syllables with respect 
to difficulty by the method of having subjects learn them. She then con- 
structed two lists of six items each, one list being homogeneous with re- 
spect to difficulty but composed of easy items; the other list also being 
homogeneous with respect to difficulty but composed of difficult items. 
She then ran four groups of subjects. The first group learned the entire 
easy list. The second group learned the entire difficult list. The third 
group, however, learned the easy list from which the last item had been 
deleted and the corresponding item from the difficult list substituted for 
it. Finally the fourth group learned the difficult list from which the last 
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item had been deleted and the corresponding item from the easy list 
substituted for it. Difficulty of the terminal item in each condition was 
determined by computing the mean number of trials required for that 
item to be given correctly. What was found was that the difficulty of the 
terminal item was determined very largely by the over-all difficulty of the 
list in which it was embedded. The easy terminal item in the difficult 
list was much more difficult to learn than was the same syllable in the 
easy list. Similarly, the difficult item when presented in the easy list be- 
came almost as easy to learn as the original easy item had been. 

The problem of the well-practiced subject. This problem also con- 
cerns itself with the neglect of possible interaction effects. The typical 
rote-learning experiment is designed somewhat as follows: let us assume 
that the experimenter wishes to determine the relative effects of four 
interpolated activities in determining amount of retroactive inhibition. 
He assembles a group of subjects, gives each subject one or more practice 
sessions to familiarize him with general rote-learning procedures, and then 
administers each condition of the experiment to each subject. Usually, 
the order in which each subject is given the various experimental treat- 
ments is determined by a counter-balanced practice order. In its simplest 
form, the counterbalanced practice order merely ensures that each condi- 
tion occurs equally often in each order. More elaborate counterbalancing 
procedures demand, in addition, that each condition precede and follow 
each other condition an equal number of times. The inadequacy of such 
a procedure should have been recognized as soon as Ward’s [17] data, 
showing that the curve of learning to learn is not linear, became 
available. 

In addition, there is the further difficulty that there may be an inter- 
action effect between the effects of a particular independent variable 
and the order in which it occurs. Moreover, it is frequently impossible 
to determine whether or not such an interaction effect exists. For ex- 
ample, some years ago Lee Thune and I had an opportunity to go over 
the data of an experiment in which it was possible to analyze the inter- 
action of state of practice with the dependent variable which was, in 
this case, retroactive inhibition. The ‘conclusion of the experiment for 
the pooled results was quite valid. One of the several conditions did 
produce more retroactive inhibition than any of the others. However, 
this effect was concentrated in the earlier stages of the counterbalanced 
practice order. When that condition was analyzed for the last day of the 
experiment, retroactive facilit¢tion rather than retroactive inhibition was 
discovered. Fortunately, there is some indication that rote-learning ex- 
perimenters are beginning to abandon this type of procedure, although it 
would not be difficult to find numerous examples of the use of counter- 
balancing procedures in the contemporary literature. The use of subjects 
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in several conditions does have advantages, of course. Within subjects, 
variability is probably reduced, particularly if several practice sessions are 
given. That is to say, the subjects in each condition are automatically 
“matched” with respect to ability. Finally, and perhaps most importantly, 
a large total number of subjects is not needed. Nevertheless, in the opin- 
ion of this writer, the use of counterbalancing procedures is basically 
unsound unless it is known that order and the effects of the independent 
variables do not interact. In any event, experimenters should design their 
experiments so that this interaction effect, if it exists, can be determined. 
This goal can be accomplished, of course, by designing the experiment 
so that a sufficient number of replications of each order of counter- 
balancing exists to make such an analysis possible. 

The problem of standardizing conditions. Earlier, in our discussion 
of the reasons why workers in the field of rote learning have produced 
so little in the way of comprehensive and systematic theory, we said that 
there was an additional reason for this lack of systematic theory. This 
reason is that the workers in this field have not had very many systematic 
data with which to work. 

Of the thousands of studies that have been conducted on rote learn- 
ing, it would be difficult to find more than a handful that have been 
conducted under comparable conditions. The same experimenter will 
often vary his technique of experimentation between successive experi- 
ments, and if the variability of technique within experimenters is con- 
siderable, the variability of technique among experimenters is enormous. 
True, most of the experiments of, let us say, McGeoch tend to resemble 
the other experiments of McGeoch with respect to methodology more 
than they resemble the experiments of Underwood or Hoyland. It is also 
true that there are isolated bodies of experiments in which the data have 
been gathered under conditions that are as comparable as conditions can 
be made to be. But these are exceptions. Generally, each experimenter 
determines the conditions under which practice in his experiment is to 
be conducted, with little attention being paid to the techniques employed 
by other investigators working in the same area. 

Some of these variations of technique are inevitable and some may 
be of little consequence. Since we have no aptitude tests for the abilities 
involved in rote learning, this must remain an uncontrolled factor, and 
these abilities probably vary from one laboratory to another. Frequently, 
because of practical considerations, an experimenter must use a particular 
piece of equipment rather than one he would prefer to use. It would 
be difficult for an experimenter working in Florida to reproduce the 
temperature and humidity conditions of an experimenter working in 
Colorado. Between investigators there are bound to be differences in 
amount of experience and degree of technical skill. Such differences are 
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unavoidable, and it is possible that many such variables have little effect 
upon the results of experimentation, although there can be no doubt that, 
in combination, they contribute to the variability of results between ex- 
periments. But most procedural variations between experiments are not 
of this unavoidable type. They result from the whimsy of individual in- 
vestigators and most of them are as unnecessary as they are inexcusable. 
Furthermore, many of these variations in procedures concern variables 
known to be important in determining performance. The most casual 
review of the literature will reveal an astonishing number of methodolog- 
ical variations, a few of which are listed below. 

1. Different experimenters give different instructions to their subjects, 
even when dealing with almost identical learning situations. 

2. Some experimenters give their subjects practice in rote learning 
before the beginning of the experiment; others do not. In the case of the 
former group, there is variation in the amount of pretraining that is 
given. 

3. Some experimenters use a counterbalancing technique; others use 
the “one shot” approach. There are also variations in the techniques 
used for counterbalancing. 

4, Usually, the specific materials learned by the subjects are not 
published, More frequently, the reader is told that the words or syllables 
were taken from somebody’s collection of materials and that they have a 
certain mean association value. There has been relatively little work done 
in the calibration of rote-learning materials. However, as might be ex- 
pected, there has been a considerable amount of variation in the methods 
of calibration employed by those who have worked in this important 
field, The result of this state of affairs is that only rarely do we find two 
experimenters who have used exactly the same set of materials. More 
usually different experimenters, working on the same general problem, 
use lists that differ in length and difficulty as well as in the specific items 
making up the list. 

5. Although the 2-sec rate of presentation has been fairly well estab- 
lished as a standard rate, variations in rate of presentation can be found. 

6. Different experimenters use different intertrial intervals. 

7. Different investigators fill short retention intervals in different 
ways. ` ` 

8. In the learning of nonsense syllables, some experimenters require 
their subjects to spell the syllables whereas others require them to pro- 
nounce them. 

These are but examples. However, if one takes any two papers on 
the same subject but prepared by different investigators, one is likely to 
find that the experimental procedures have differed in several (if not in 
all) of these ways. Under such circumstances, construction of a precise 
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quantitative theory is almost impossible. Even the generality of any law- 
ful relationship obtained under such circumstances must be indeterminate 
and unspecifiable. 

The point is not that a list of nonsense syllables used by Thune is 
better than a list used by Archer, or that one person’s set of instructions 
is necessarily better than the set of instructions used by someone else. 
The point is that gratuitous variation of practically every circumstance 
under which learning occurs, and this without plan or design, has made 
it impossible to compare results from different experiments that should 
be, at least roughly, comparable. 

This situation exists in most other areas of psychology, of course, 
although this fact does not particularly help the situation in the field of 
rote learning. There is no standard T maze, no standard Skinner box, 
no standard jumping stand. Amounts, kinds, and frequencies of rein- 
forcement differ from one animal experiment to another. It is often 
argued that standardization of conditions would restrict the experi- 
menter; that it would let him do only particular kinds of things. The 
truth is that standardization would increase the scope of his activities, 
for it would permit him to rely upon previously obtained experimental 
results with much more confidence than he is able to rely upon such 
results at the present time. The argument is not that an experimenter 
should be prevented from using any variable that he thinks is relevant as 
an independent variable. The argument is that he should do this within 
a standard frame of reference and that he should depart from that frame 
of reference by intention rather than by whimsy. 

I feel that perhaps it is time for those who are primarily interested 
in rote learning, and there are not too many of them, to get together and 
decide, arbitrarily if necessary, upon a set of standard conditions. In the 
case of most of the variables I have listed, it should not be difficult for 
them to reach agreement. The calibration of lists presents the thorniest 
problem. In order to do the job properly, a great amount of very tedious 
and unglamorous work would be required. Still, on the basis of calibra- 
tions of materials that now exist, it would not be difficult to prepare a 
group of lists, each list being designated by a code number. There would 
have to be several lists, each of several lengths and levels of difficulty. 
Moreover, there would have to be pairs of lists varying in length, diffi- 
culty, and similarity. There would have to be paired-associate lists as well 
as serial lists; and in all cases, lists of nonsense syllables, consonant sylla- 
bles, nouns, and adjectives should be constructed. Fortunately, a great 
deal of this work has already been done. What is needed is the arbitrary 
designation of the lists that already exist. Furthermore, experimenters 
should not be given absolute freedom to use any list they choose from 
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among the collection of lists. For example, agreement might be reached 
that experimenters wishing to work on the serial anticipation learning 
of nonsense syllables would work with a 10-item list taken from the 50 
per cent association value group of lists. If such agreement could be 
reached, and if a report could be published, it would be a considerable 
aid to scientific communication. Furthermore, editors of journals could 
be enjoined, and, I feel, with considerable success, to reject papers that 
did not conform to the standard conditions of experimentation. 

This is not to say, of course, that experimenters should not vary 
length of list, intertrial interval, the instructions to the subjects, and all 
the other variables that might be listed. To require this would be to re- 
duce rote-learning research to the endless repetition of a single experi- 
ment, the experiment of the standard condition. What is implied is that 
if an experimenter wishes to determine the effects of intertrial interval 
upon learning or retention or reminiscence, he should not also use origi- 
nal instructions, a nonstandard rate of presentation, and a list of dubious 
ancestry. It would also be perfectly legitimate to vary two or more vari- 
ables using a more complicated experimental design. Indeed, it would 
be very desirable to do so, for in this way possible interaction effects 
might be discovered. But, always, research should be conducted within 
the frame of reference established by the set of standard experimental 
procedures. In a perfect world, it would be very desirable if the specific 
standard conditions could be established empirically. However, the arbi- 
trary designation of a set of standard conditions would represent a tre- 
mendous scientific advance. There can be no doubt that this is one of 
the most pressing problems of contemporary research in the area of rote 
learning. 


A TYPICAL PROBLEM AND SOME OF THE THEORIES 
RELATING TO IT 


It is clearly impossible to discuss all the problems with which workers 
in the field of rote learning have dealt. Such problems as serial position 
effects, transfer phenomena, remote associations, and retroactive and 
proactive inhibition come immediately to mind. To discuss them all 
would require the writing of a book rather than a paper. In view of 
this fact, I have selected one phenomenon, the effects of distributing 
practice. This includes, of course, the special case of reminiscence, It is 
not that distribution of practice is more important than many of the 
other problems that might be considered. Rather, in a discussion of dis- 
tribution of practice and the theories that have been advanced to explain 
the data that have been gathered in this area, we are considering an 
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area of study that may be considered to be a typical example. What may 
be said of distribution of practice might also be said about most other 
areas of rote-learning research. 

The phenomena relating to distribution of practice are well known 
and there is no need to review them here. It should be sufficient to note 
the general fact that the introduction of rest periods into a series of prac- 
tice sessions tends to increase the efficiency of acquisition and that, in 
general, the longer the rests, the more the improvement in performance. 
Naturally, there must be an optimal length of rest for any given type of 
task. Similarly, it is a fairly general finding that the amount of practice 
between rests is important, shorter practice periods between rests being 
more effective than longer ones. Again, we should expect that there is an 
optimal length of practice period for a given task and for a given length 
of the interpolated rest periods. Thus, in learning a list of 12 nonsense 
syllables, if one trial is given every two years, this pattern of distribution 
would probably not be as effective as massed practice, trial for trial. It 
certainly would not be as effective as massed practice in terms of the 
total time required to learn the list. Our primary concern here, however, 
is not to review the literature on distribution of practice, but, rather, to 
consider the theoretical approaches that have been made in an attempt 
to explain the phenomena of distribution. 

Concerning the classification of theories. A considerable amount 
has been said in recent years about the classification of theories. We have 
classified theories as being either S-R or R-R theories, as being behavioral 
or physiological, molar or molecular, or reinforcement rather than non- 
reinforcement in orientation, Everyone is familiar with these distinctions 
and there is no need to go into a detailed description of them. We shall, 
however, examine rote-learning theories in the light of these classifications. 
Before doing this, there is one more classificatory distinction that should 
be made because it is applicable to most of the theories that have been 
advanced in the field of rote learning. This is the distinction between 
theories that are vague and those that are precise in terms of prediction. 
Vague theories never die, nor do they usually fade away. This is because 
they are very nearly incapable of being disproved. Consider, for ex- 
ample, the classical perseveration theory as it applies to reminiscence 
and distribution phenomena. The only predictions that can be made on 
the basis of this theory are that performance will be better shortly after 
the end of practice than it was at the end of practice and that distributed 
practice will be more effective, trial for trial, than will be massed practice. 
No other specific predictions are made. Of course, there is a body of 
data concerning a host of the variables upon which distribution effects 
and reminiscence depend. These, however, can be incorporated into the 
theory by verbal fiat, since it is merely necessary to explain any specific 
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datum by saying that this is the way in which perseveration operates. 
Precise theories are much more vulnerable. If specific predictions are 
made concerning, let us say, the exact time that reminiscence reaches 
its maximum following a specified amount and kind of practice on a 
particular task, then the theory must be very nearly correct. With even 
a slight inaccuracy of prediction, it is only necessary for an experimenter 
to run a sufficient number of subjects to demonstrate that the divergence 
between prediction and data is too large to be accounted for by errors 
of measurement for the theory to be discredited. Thus, however much 
we may yearn for precise, quantitative theories, such theories tend to die 
young. Vague theories march on through the years and are eventually 
abandoned (usually to be resurrected once again) only because people 
are bored with them or because some more attractive hypotheses have 
been advanced. 

Of course, it goes without saying that a theory need not be quantita- 
tive in order to be precise. Most precise theories are quantitative, but a 
qualitative theory that makes definitive predictions cannot be classed as 
vague. For instance, a theory that predicts that one individual would 
suffer from schizophrenia during his lifetime whereas another would not 
could scarcely be called a quantitative theory, yet it would not be vague 

“in its predictions. The accuracy of the theoretical prediction is, of course, 
beside the point. The prediction is made, and in terms of the prediction, 
the theory can be tested. 

One further word should be said about rote-learning theory. Prac- 
tically no attempt has been made to set forth physiologically oriented 
theory in this area. To be sure, the perseveration theory that we men- 
tioned a moment ago has the sound of physiological explanation. 
Similarly, Hull et al. [7] have used such concepts as the stimulus trace. 
It is also true that individuals working within the framework of informa- 
tion theory have appealed to a physiological model, but it would prob- 
ably be more correct to say that some physiologists have appealed to an 
information theory model. For example, the work of von Förster [16] 
on memory uses the formal model of quantum theory and relates this, 
Somewhat vaguely, to physiological theory. Actually, however, these 
theories could exist as behavioral theories without carrying an additional 
cargo of physiological speculation. The reason for this lack of physiologiz- 
ing is, of course, that so little is known of the neurophysiology of learning 
that theories are behavioral out of necessity even though some of them 
use a terminology that smacks of physiology. 

As to the status of rote-learning theories with respect to the classi- 
fications of theories mentioned earlier, it must be noted that almost all 
rote-learning theorizing has been done in the framework of S-R concepts. 
This undoubtedly arises from the fact that the materials used in rote- 
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learning research lend themselves so well to this type of analysis. The 
question as to whether rote-learning theories have tended to be molar or 
molecular is more difficult to answer. In order to answer the question, 
some definition must be given to the terms molar and molecular. It is 
true that most rote-learning theories, in so far as they use any unit of 
analysis at all, tend to use the individual item in the list as the unit. To 
this extent they tend to be molecular. Indeed, such theories have fre- 
quently been criticized as being too atomistic. Some of this criticism may 
be justified. It is true that any given list of nonsense syllables or other 
verbal material is likely to have some sort of organization within itself. 
It is also true that the individual learner organizes the material that is 
presented to him. Both of these factors may be important in determining 
the individual’s response to a particular item. On the other hand, it 
would be unfair to accuse the workers in this field of being blind to these 
problems. The literature concerning serial position effects, for example, 
(and it is a large literature) represents an attempt to study intraserial 
organization. The same thing may be said for much of the work on 
remote associations or transfer of training. But does analysis at the level 
of the individual nonsense syllable constitute atomism? No theories have 
been advanced that considered rote learning in terms of the movements 
made when the individual syllables were pronounced or concerning the 
nerve impulses that give rise to these movements. On the whole, however, 
it must be conceded that rote-learning theory is approximately as 
molecular as our knowledge and experimental techniques will permit it 
to be, 

As far as the reinforcement issue is concerned, the issue is confused. 
There have been nonreinforcement theorists who have worked in the 
area of rote learning. However, it is probably safe to say that the majority 
of workers in this area, at least in recent times, have subscribed to some 
kind of reinforcement principle. The difficulty is that almost no work 
has been done in which amount, kind, and delay of reinforcement have 
been the independent variables. Those who subscribe to the reinforce- 
ment position tend to feel that rewards are “built in” to the rote-learning 
situation. This is usually rationalized by an appeal to the somewhat 
vague notion of knowledge of results. Not nearly enough is known about 
knowledge of results and its effects upon learning, and unfortunately, the 
classical rote-learning situation is one in which the systematic manipula- 
tion of this variable is extremely awkward. As a result of this situation, a 
considerable amount of lip service has been paid to reinforcement 
principles, but little theoretical or experimental work has been done on 
this problem. 

From what has already been said, it should be apparent that the 
great majority of rote-learning theories have been delightfully vague and 
that, as a result of this fact, many of them are hoary with age. 
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With respect to specific theories of reminiscence and the distribution 
of practice, these theories tend to fall into three main groups. Within 
each group except one there are several more or less independent 
hypotheses. 

Work theories. The first group of theories may be termed work 
theories. The general notion is that practice, although serving to 
strengthen habits, also produces some negative state that interferes with 
performance. With rest, this negative condition, always postulated to be 
more transitory than habit, tends to dissipate and performance is im- 
proved. In simple language, this is the fatigue theory. Some years ago, 
however, psychologists decided that fatigue was not a nice word to use, 
and as a result of this decision, a number of more or less synonymous 
terms have been coined. Thus, unfashionable though it may be to talk 
about fatigue, it is perfectly proper to talk about reactive inhibition. Of 
course, the definition of reactive inhibition as given by Hull [8] comes 
about as close as one could wish to being an excellent working definition 
of fatigue considered as an intervening variable. The concept of reactive 
inhibition does have one advantage over the older concept of fatigue. 
It opens the door to quantification. Perhaps it would be more precise 
to say that it unlocks the door in view of the fact that no one has at- 
tempted to apply the concept to rote-learning data in any strict sense. 
Even in the areas for which the concept of Iz was designed, the concept 
has proved to be cumbersome and somewhat confused. In the field of 
rote learning, the difficulties are even greater. The principal difficulty 
lies in the extreme difficulty of obtaining a measure of the amount of 
work done in making rote-learning responses. 

Work theories break down into several hypotheses concerning the 
way in which work accomplishes its negative effect. These hypotheses 
vary according to the theoretical predilections of their formulators. In 
few instances do the several hypotheses yield different predictions. This 
is partly because most of the work theories make very vague predictions 
and partly because different hypotheses employ the same independent 
variables to predict the same dependent variables, but use a different 
name for the mediating cause or intervening variable. For example, both 
the fatigue theory and the motivation theory can predict the same 
events, In one case, it is maintained that the individual, having per- 
formed a certain amount of work during practice, becomes fatigued and 
that, as a consequence, his efficiency is impaired. The motivation theory 
would hold that the individual, having performed a certain amount of 
work during practice, loses motivation to continue and that this loss of 
motivation, in turn, depresses the efficiency of his performance. In each 
case, presumably, the individual will accumulate fatigue or suffer from 
decreased motivation in proportion to the amount of work done, and 
he will recover from these negative states in proportion to amount of 
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rest given. Much the same thing can be said for a stimulus saturation 
type of hypothesis. The classical example of such a hypothesis is con- 
tained in Pavlov’s [13] concept of internal inhibition. Internal inhibition 
supposedly builds up through the presentation of stimuli rather than 
through the making of responses. However, since the presentation of the 
conditioned stimulus is often followed by the making of a response, the 
work factor is confounded with the stimulation factor, at least in part. A 
possible crucial experiment would consist of having the subjects par- 
ticipate in rote learning under conditions of massed and distributed 
trials and also under circumstances in which it would be impossible to 
respond (as under the influence of curare). Following a standard num- 
ber of trials of practice, a delayed retention test could be given. 

It will be noted that all the work theories employ two theoretical 
variables in order to explain the phenomena of distribution of practice 
and of reminiscence. One of these variables is positive with respect to 
performance and is usually termed habit strength or excitatory potential 
or something that resembles these terms closely. This variable is con- 
sidered to be relatively permanent in the sense that it does not dissipate 
rapidly in time. The other variable is negative with respect to perform- 
ance. As we have seen, the negative variable may have a number of 
different names. The negative variable is held to be relatively transitory. 
Usually, it is held that the positive and negative variables interact by 
simple subtraction, 

The weakness of a theory that employs two opposed theoretical 
variables is obvious. Unless the operations that define the independent 
variables are tied to the theoretical variables specifically and in detail 
and unless the theoretical variables are related specifically to the measure- 
ment of the dependent variables, the theoretician can “explain” almost 
any experimental finding by manipulating the supposed values of the 
intervening variables. Under these circumstances, however, he is unable 
to predict his results in advance. 

An analogy might prove helpful in connection with this point. If a 
railroad car is shunted onto a straight level siding, it should be possible 
to predict with some precision how far the car will roll before it comes 
to rest. In order to make this prediction, we would have to know the mass 
of the car, its initial velocity, and certain factors that would enable us 
to compute the amount of rolling friction. Again, we have two theoretical 
variables, momentum and friction. Again, the latter variable detracts 
from the former. Now, if we were in the position of the rote-learning 
theorist, we would not know how to compute momentum nor would we 
be able to estimate the amount of friction in the system. As a result of 
this state of affairs we could explain any result. If the car did not roll as 
far as anticipated, we could explain this by saying that there was high 
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friction or low momentum or both. It is apparent that an infinite number 
of combinations of numbers representing the two variables could be 
devised, any one of which would “explain” perfectly why the car rolled 
a certain distance and no farther. It is also apparent that, under such 
circumstances, prediction would be haphazard. 

The argument that is being advanced does not apply only to work 
theories of the distribution phenomena or to rote-learning theory in 
general. It has been a favorite device of a great many psychological 
theorists in many areas to set up a pair of opposed theoretical variables 
(without defining their characteristics specifically, of course). Once this 
is done any result can be “explained” after it has been obtained, but 
preexperimental prediction is nearly impossible. Many examples of this 
type of theorizing come immediately to mind. Positive and negative 
valences (combined with barriers), unless they are defined in precise 
detail, can explain almost anything—after it has happened, Many of the 
Freudian explanations suffer from the same flaw. These other inade- 
quacies of psychological theory, of course, do not excuse the existence of 
the same flaws in rote-learning theory. 

The differential forgetting theory. In some ways the differential for- 
getting theory resembles the work theories we have already discussed, 
at least in a formal sense. Because different theoretical concepts are 
used, however, this similarity may escape notice. Nevertheless, again 
we are dealing with a theory that postulates two factors, one positive 
and one negative. It is true that, in the case of the differential forgetting 
theory, the two theoretical variables are of the same nature. Briefly, the 
differential forgetting theory holds that, during practice, two sets of 
habits are acquired. One set represents the correct responses. The other 
set represents incorrect responses that presumably interfere with the 
execution of the correct responses. It is further held that the incorrect 
habitual responses are, on the average, weaker than the correct habitual 
responses, If one then assumes that weak habits are forgotten at a faster 
rate than strong habits, one could deduce that after a short rest interval, 
a net benefit should accrue to the strong habits. It should be noted that 
this is precisely the same mechanism of prediction employed by the work 
theories. One theoretical variable (the strong correct responses) tends to 
resist the ravages of time. The other variable (the weak incorrect re- 
sponses) tends to dissipate in time. 

It should be emphasized that there are no “pure” measures of the 
strengths of either the correct or the incorrect habits. It has been sug- 
gested that overt errors might prove to be a reliable index of the strength 
of the incorrect habits, but the use of this measure rests upon the as- 
sumption that the number of overt errors (a known quantity ) is related 
to the total incorrect habits’ strength in a fixed ratio, regardless of the 
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stage of practice. Quite apart from the fact that we have no way of 
arriving at a reasonable value for the fixed ratio, it is extremely doubtful 
that such an invariant relationship exists. For example, if an individual 
is able to recite a 12-unit list of nonsense syllables perfectly on a single 
trial there are, of course, no overt errors on that trial. It would be 
foolish, however, to conclude that intraserial interference had been re- 
duced to zero. About the best that can be done is to demonstrate that, 
in the learning of whole lists, those lists that are given more practice or 
are learned to a higher performance criterion tend to be forgotten less 
rapidly than those lists that are less well learned. This would, to some 
extent, demonstrate the reasonableness of the theory. On the other hand, 
it should not be forgotten that, in following this procedure, the measure- 
ments are very gross and the results of such experiments contain the 
effects of forgetting of the incorrect, interfering responses as well as the 
forgetting of the correct responses. 

We have talked about the dissipation of strong and weak habits in 
time, and in a sense this was proper since, in distribution experiments, 
time is one of the independent variables, However, if one holds to an 
interference theory of forgetting, an interesting prediction can be made. 
If one combines an interference theory of forgetting with the differential 
forgetting theory of distribution effects, one has to conclude that, during 
the rest periods, the subjects learn (informally) material that interferes 
with both the strong, correct habits and the weak, incorrect habits. 
Furthermore, one would have to conclude that the interference affects 
these two sets of habits differentially. If this is the case, it should be 
possible to enhance distribution effects by introducing minute amounts 
of interpolated material designed to cause interference during the rest 
periods. By increasing the amount and similarity of the interpolated 
material, of course, this trend should be reversed and distribution (with 
interpolated learning in the rests) should be less effective than con- 
tinuous practice, 

The perseveration theories. In contrast to the “two-factor” theories 
we have discussed above, perseveration theories use but a single 
theoretical variable. In all these theories the general notion is that 
practice (or stimulation) sets in motion learning processes that con- 
tinue after formal practice has ended. As is true of work theories, the per- 
severation theories do not differ sharply among themselves with respect 
to their formal characteristics although there may be considerable dif- 
ferences in terminology. Here is the fundamental question: are we to 
consider distribution effects to be a result of the interaction of two 
theoretical variables, or will one theoretical variable suffice to illuminate 
the data? ` 

The clearest example of a perseveration theory is the rehearsal theory. 
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t a naive level, what is implied is that the subject employs the rest 
als in a distribution experiment to rehearse the material to which 
has been exposed. Again, at a naive level the rehearsal theory is 
ally differentiated from the classical perseveration theory of Miiller 
d Pilzecker [11] on the basis of whether or not the subject intended to 
hearse or whether or not the rehearsal was conscious. These distinctions 
ply an involvement with the mind-body problem which we cannot 
nd, indeed, should not, discuss here. Needless to say, the distinctions are 
‘not mutally exclusive. A person may be aware, i.e., be able to verbalize, 

t the learning materials have been subvocally rehearsed during the 
est intervals, but he may be completely unaware of a verbalized in- 
tion to rehearse them, Some of the most frequently heard, and hence 
most banal, singing commercials may be whistled or subvocally sung 
y an individual for hours at a time even though he earnestly desires to 
t those particular sequences of tones from his existence forever. 
larly, it is possible that inarticulate rehearsal occurs, the individual 


and rehearsal theory are probably misleading. 

At the same time it should be pointed out that psychologists have 
pent a great deal of time and have exercised a considerable amount 
of ingenuity in order to prevent rehearsal (and, hence, perseveration ) 
from taking place during the rest intervals of distribution experiments. 
These efforts to control the effects of rehearsal and perseveration, in- 
‘cidentally, may very possibly have introduced more variability into the 
than would have been the case if no such control had been at- 
tempted. It is interesting to note, however, that little has been done to 
“determine what the effects of rehearsal (considered as an independent 
variable) might be. The almost universal assumption has been that re- 
hearsal during the rest periods would enhance postrest performance, The 
results obtained by Rohrer [14], who instructed his subjects to rehearse 
luring the rest period, certainly do not support this assumption." If re- 
hearsal, as a form of perseveration, has any effect upon post-rehearsal 
‘performance, it should be possible to treat rehearsal as an independent 
‘Variable and to determine how rehearsal relates to other variables in the 


*Some years ago G. R. Grice, J. Kling, and I performed an experiment that 
‘was intended to expand Rohrer’s findings. The data have never been published 
owing to the fact that the randomly selected groups did not seem to be reasonably 
Well matched with respect to rote-learning ability. However, the data suggest 
quite strongly that, if subjects are instructed to rehearse during a rest interval 
introduced early in practice, rehearsal is beneficial to subsequent performance. If 
_ fehearsal is introduced after a moderate degree of practice, rehearsal appears to 

have little effect. Late in practice, the data suggest that rehearsal may cause a 
decrement in subsequent performance. 
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determination of learning performance. Far too little work has been 
done on this important topic. 

In addition to the classical perseveration theory and the rehearsal 
theory, the stimulus-maturation hypothesis should also be mentioned. This 
theory, proposed by Wheeler [18] and Wheeler and Perkins [19], clearly 
belongs with perseveration theories. According to the stimulus-matura- 
tion hypothesis, learning is a form of maturation or growth, dependent 
upon stimulation, but relatively independent of the rate of stimulation. 
If enough initial practice is given to initiate this growth, presumably the 
growth will continue without additional stimulation. This implies that 
performance will be closely related to the amount of time that has 
elapsed since the beginning of practice but that performance will not be 
closely related to the number of trials. There are obvious limits to this 
concept of perseverating growth, although these limits have not been 
spelled out in the theory. It is apparent that practice can be so widely 
spaced that little or no learning takes place. On the other hand, we 
have ample evidence to support the view that moderate distribution en- 
hances performance. These facts make it necessary to add to the theory 
(if it is to be stated in specific form) a statement as to the amount of 
time the growth process will continue to operate following stimulation. 
If the stimulus-maturation hypothesis is considered in this way, it is at 
once placed on the same basis as other perseveration theories. That is, 
something continues after practice has ended to produce increments in 
performance. This continuing process operates for a certain period of 
time and then stops, unless additional practice is given. Wheeler prefers 
to refer to this continuing process as growth or maturation. Others have 
preferred to speak in terms of the perseveration of the neurophysiological 
processes that underlie learning. Still others have preferred to speak of 
rehearsal. In terms of a behavioral formulation, all are saying the same 
thing while using different words. 


A FINAL WORD 


Except for the Mathematico-deductive Theory of Rote Learning, 
little has been produced in the way of comprehensive systematic theory 
in the field of rote learning. A tremendous quantity of data exists how- 
ever. Most of these data are subsumed under various hypotheses that 
were designed to cover only a limited range of phenomena. Within each 
area of rote-learning research (such as transfer, serial phenomena, 
reminiscence, etc.), we usually find several different special hypotheses. 
More frequently than not, these differ with respect to their terminology 


to a greater extent than they do to their formal characteristics. Quantita- 
tive theory is virtually nonexistent. 
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One reason for the lack of quantitative theory is that the independent 
variables have never been properly scaled and dimensionalized. Perhaps 
there is no way in which this can be accomplished. Perhaps we shall 
never have units comparable to the meter, gram, and second against 
which our independent variables can be plotted. However, an attempt at 
scaling the important variables could be made. Little has been accom- 
plished in this area. Another reason we have already mentioned: the 
fact that no set of standardized conditions has ever been devised for the 
research that is being done in this area. 

What does the future hold? I feel that it is safe to say that experiments 
in the field of rote learning will continue to appear in our journals. The 
frequency of such experiments will probably decrease relative to the fre- 
quency of experiments in other areas of learning experimentation. In 
addition, I feel that the time is propitious for some people who are in- 
terested in this area to attempt to review the enormous amount of in- 
formation that has been gathered and to attempt to incorporate large 
amounts of these data within a single theoretical framework. Some of this 
work has already been undertaken. Osgood’s [12] attempt to cast data on 
transfer of training into a single system is an example of this type of 
activity. Unfortunately, in this instance, Osgood was hampered by the 
fact that neither of the two independent variables he employed could 
be quantified in any very meaningful way. This, of course, leaves the 
shape of the transfer surface which he drew somewhat indeterminate. 
Nevertheless, further attempts of this sort should be made. Finally, and 
above all, there is a desperate need for standardization of conditions. 
With standardization, quantification of our data and our theories be- 
comes a potentiality. Without standardization, the present state of rela- 
tive chaos is likely to continue for an indefinite period. 
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The term ethology, formerly used in a broad sense for the study of 
animal behavior and ecology, is now taken to refer primarily to the 
former. Used in this way it should refer to a field of study, and not to any 
particular theories or methods used within that field. In recent years 
it has come to be employed particularly by a group of workers who, 
though differing widely themselves in the problems they tackle and in the 
theoretical interpretations (if any) which they adopt, share certain basic 
orienting attitudes which are perhaps more important as unifying factors 
than any particular theoretical scheme [129, 207, 222]. Partly because 
of the difficulties of international communication during the period when 
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the early work was done, these workers happen at the moment to be 
mostly Europeans. They have, of course, no exclusive right to the term 
“ethologist”—indeed, similar approaches have been used independently 
with great effect by other workers in America and elsewhere. But it is 
convenient, for the purposes of this volume, to restrict discussion mainly 
to the work of this group, even though this necessitates the use of 
“ethology” in a restricted, and perhaps already outmoded, sense. 

Although this work can be said to have sprung primarily from the 
publication of Lorenz’s Der Kumpan in der Umwelt des Vogels in 1935 
[121], a shortened English version of which was published in 1937 
[122], the seeds had been sown a generation earlier by such workers as 
O. Heinroth in Germany, C. O. Whitman and W. Craig in the United 
States, and J. Huxley and E. Eliot Howard in England. During the pre- 
war and war years the literature was little known in the English-speaking 
world, but in the last decade it has had an increasing influence among 
zoologists and psychologists. At the present time, papers appear prin- 
cipally in Zeitschrift fiir Tierpsychologie, Behaviour, British Journal of 
Animal Behaviour (now called Animal Behaviour), and a number of 
other biological and psychological journals. Tinbergen [216] has pro- 
vided a recent summary of much of the work discussed here in his book 
The Study of Instinct. Thorpe [209] has provided a link with studies in 
other fields in Learning and Instinct in Animals. 

Since ethologists are now numerous and hold divergent views on 
many topics, it is no easy task to present a summary or analysis of their 
views, Indeed, now that their attitudes and ideas have spread to related 
fields also concerned with animal behavior, it was difficult to decide 
which work should be included in the present discussion. Therefore, 
although the author is deeply indebted to those (Professor G. P. Baer- 
ends, Dr. D. S. Lehrman, Dr. P. Marler, Dr. W. H. Thorpe, Dr. N. 
Tinbergen, Dr. W. S. Verplanck) who have read this chapter in draft 
and made innumerable helpful criticisms and suggestions, he cannot 
claim that, in its final form, it represents any opinion but his own. 


Orienting Attitudes 

The three whys. The men whom Lorenz [128] describes as the 
“pioneers” of this approach—Whitman, Heinroth, Huxley, Verwey, and 
others—were for the most part zoologists. This zoological bias has greatly 
influenced the development of the school and has been especially im- 
portant in determining the kind of questions asked. For the zoologist, 
the answer to the question “What causes it to be here?” gives only a 
partial understanding of the structure concerned, Other problems also 
are raised—“How does it work and what is its function?” “What is its 
evolutionary origin and history?” and so on. Similarly, when the ethologist 
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asks “Why does the animal behave in this way?” he requires at least 
three answers— causal,* functional,” and historical. 

It is, apparently, still necessary to emphasize that the study of func- 
tion and evolution, as well as causation, does not imply a confusion be- 
tween them—the three questions are distinct. Nor does it imply that all 
behavior is functional in that it contributes to survival or reproduction: 
of course it is not. (However, even apparently nonfunctional behavior 
may be the product of mechanisms or potentialities of biological sig- 
nificance in other contexts. ) 

Although these three aspects of behavior study are quite separate, 
advances in one may assist progress in another. For example, knowledge 
of the role of “releasers” (p. 584) in the causation of behavior is essential 
for an understanding of the evolution and function of bird courtship dis- 
play (p. 590). This sort of liaison between the different approaches has 
been particularly fertile because it was an interest in evolution which 
first focused attention on the relatively stereotyped motor coordinations 
known as “Fixed Action Patterns” (p. 573) and promoted research into 
their causation [cf. Lorenz, 128, 129]. 

Nevertheless, the greatest part of the work discussed here was directed 
at problems of causation, and in conformity with the other chapters in 
this book, it is with this that we shall be principally concerned. 

Scope. The work of these ethologists cannot be characterized by the 
kind of phenomena studied—although attention has been concentrated 
mainly on the behavior of insects and lower vertebrates, work has also 
been done on animals ranging from Protozoa to man, and has been 
concerned with both relatively simple reflexes and complicated learned 
behavior. In fact the approach is essentially a broad one, and many of 
these “‘ethological” studies could equally be classified as psychological, 
zoological, taxonomic, ecological, physiological, neurophysiological, 
endocrinological, pharmacological, genetic, or even botanic. The specific 
characteristics of the work thus lie not in its scope but in matters of 
attitude and emphasis. It is the purpose of the following pages to make 
these explicit. 

Levels and aims. The material with which ethologists are concerned 
is usually at the behavior level. If prediction of behavior, given the ante- 
cedent conditions, were the sole aim, there might be no need to analyze 
further than this: reference to underlying mechanisms could be un- 


*Ethologists have not been over-sophisticated about the word “cause,” and so 
far no confusion has resulted. For those who prefer it, read “variable, of which 
behavior is a mathematical function.” 

*In asking questions about the function of a type of behavior, it can usually 
be understood that the ethologist wants to know about the nature of the selection 
pressure to which the animal species has been subjected for individuals to behave 
in this way (see p. 577). 
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necessary. But even if the complete prediction of behavior were possible, 
we should still have advanced only one stage towards its full under- 
standing: a further stage would be reached when the organization of 
behavior could be understood in terms of the physiological organization 
which it reflects. Thus the hypotheses of ethologists must be judged not 
only at the behavior level, but also in terms of their compatibility with 
work at lower ones. 

Indeed, a theoretical system which by its nature cannot be related 
to physiological data is unlikely to have a wide validity eyen at the 
behavior level. To take a parallel from physics, Toulmin [228] has 
pointed out that it is “a great virtue of a good model that it does suggest 
further questions, taking us beyond the phenomena from which we 
began, and tempts us to formulate hypotheses which turn out to be 
experimentally fertile.” Thus the Greek antenna-model of sight, and the 
models of thermal and gravitational phenomena as the effects of caloric 
and gravitational fluids, prompted unprofitable questions; on the other 
hand “the model of light as a substance in motion is a good model, not 
only because it provides us with an easily intelligible interpretation of the 
diagrams of geometrical optics, but also because it . . . leads us to 
speculate about light-particles or light-waves as the things which 
travel, or are propagated: these speculations have borne fruit” [228]. 
Of course, just as an explanation of shadow-casting in terms of the wave 
theory of light tends to be laborious, prediction of behavior via all the 
physiological intermediaries would be hopelessly cumbersome. But the 
behavior model should be at least compatible with, and—if possible— 
translatable into, the physiological one. For the ethologist, therefore, 
analysis down to a physiological level is a desirable, but not at present 
usually attainable, aim. 

“Physiologizing” is, however, more than a long-term aim. Physiology 
is a valuable source of information for the behavior student. A fertile 
psychology cannot afford to ignore physiological data which may suggest 
useful concepts. Further, psychologists who deny themselves such data 
may be neglecting an important source of evidence on which to base 
their predictions of behavior. Ethologists aim, therefore, at an analysis 
of behavior into units which the physiologist will be able to handle— 
and believe that they can be guided in that analysis by a consideration 
of present physiological knowledge [cf. 69]. J 

Description and classification. Psychologists who try to model their 
approach on that of classical physics fail to recognize that it dealt largely 
with everday events—falling apples, stick in water, or melting ice. It 
could, therefore, often afford to pass quickly over the descriptive phase 
which is so important in the development of other sciences. In biology, 
by contrast, the work of generations of taxonomists and systematists was 
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necessary before much progress could be made, and in the study of be- 
havior description and classification are equally essential. 

Psychologists whose theoretical systems have been based on limited 
experimental techniques have ignored this, concentrating their attention 
on a few responses and neglecting the rest of the behavior of the animal. 
Indeed the laboratory experimenter, in an attempt to reduce variability, 
often not only purposely restricts the behavior of the animal in which 
he is interested, but also limits sharply the aspects which he is willing 
to record.® The initial data of ethology, on the other hand, often refer 
to animals living free in nature, whose behavior is not controlled by the 
limitations of experimental surroundings: the behavior must thus first 
be described and classified. Much of the preliminary work in ethology 
consists of this: it forms an essential first stage before analysis can be 
undertaken. 

Experimental techniques. The experiments which follow this descrip- 
tion and classification must be flexible, and the techniques used suited 
to the problem in hand and the stage of analysis reached. Sometimes ap- 
parently crude experiments in the field yield more useful results than 
superficially sophisticated ones in the laboratory, where the meaning 
of the results is obscure. The investigator who overemphasizes the role 
of precise laboratory experiments may limit his own view, for it becomes 
less likely that the animal will do anything which the experimenter does 
not expect. Of course, controlled experiments are essential in the study of 
behavior, but should come after qualitative study. 

Comparative approach. Generalizations about behavior which are to 
be valid for many species must be based on comparative work: gen- 
eralizations which precede comparison can never be more than guesses. 
However, the comparative method must be used cautiously, for ap- 
parently close similarities may be merely parallel evolutionary adaptations 
to a similar environment, and rest on quite different causal bases. It is 
thus necessary to choose for comparison first species believed on other 
grounds to be closely related phyogenetically, and only then more dis- 
tantly related ones. 

Pursued in this way, a comparative approach does more than permit 
generalization. Even in the initial stage of selective description, knowl- 
edge of the behavior of one species may be helpful in the study of a 
related one. The present writer would certainly never have noticed 
the occasional pivoting movements made by Greenfinches (Chloris 
chloris) if he had not already been familiar with the highly ritualized 
(p. 595) and thus conspicuous pivoting display of the related Goldfinch 
(Carduelis carduelis) [79]. At many other levels complex behavioral 


3 = $ 
Just because of this, the “displacement activities” shown by rats in mazes 
and puzzle boxes often went unnoticed. 
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phenomena which have been elaborated after a long evolutionary history 
may be more easily understood if first studied in a relatively primitive 
and uncomplicated form. Further, comparison is the only method avail- 
able for investigating the evolution of behavior—the origin of many dis- 
play postures would be quite obscure if it were not possible to observe 
them in a less ritualized form in related species [128, 129]. 

Finally it must be emphasized that the differences revealed by com- 
parative study may be as interesting and important as the similarities: 
both will provide new theoretical insight into problems of causation, 
function, and evolution. Such differences must, however, be considered 
in relation to the biology of the species: for example, measures of “in- 
telligence” obtained by applying the same test to widely differing species 
are meaningless unless this is done [9, 216]. 

Quantitative specificity. Analysis of behavior must start from a 
qualitative basis and by stages become more quantitative: the early 
qualitative work is an essential preliminary to a more precise analysis. 

When generalizations are made, the broader the generalization the 
less quantitative specificity can it have. There is, therefore, a limit to the 
precision which is necessary before principles of broad application can 
be stated. On the other hand, of course, for detailed prediction a high 
degree of specificity is essential. The degree of quantitative precision 
which is desirable in any particular case thus depends on the immediate 
aim. 

Organization. Informal exposition is at present best suited to the 
varied nature of the work discussed here and the tentative status of the 
concepts employed. In some branches of the subject a more formal ap- 
proach may become desirable in the near future. 

Use of models. Three models which have played a part in the 
development of ethology (as here defined, p. 562) must be mentioned. 
Lorenz fe.g., 124, 128] used a hydraulic reservoir model in discussing 
problems of motivation: this was an “as if? model manifestly remote 
from the mechanisms it was supposed- to represent. Lorenz hims 
referred to it as a “crude model” which was used merely “to symbolize 
a surprising wealth of facts really encountered in the reactions of animals” 
(see p. 582). Tinbergen [e.g., 215, 216] used a basically similar model 
in his “hierarchical system” of “nervous centres” which are “loaded” with 
“motivational impulses.” Though he recognized the “provisional and 
hypothetical nature” of his scheme, Tinbergen described it as “a graphic 
Picture of the nervous mechanisms involved.” These models have been 
Very useful in synthesizing the products of analysis and in orienting ideas, 
but they are unlikely to continue to be fertile in the future. Tinbergen’s 
scheme contains a number of properties which result from its hydraulic 
nature and are misleading when applied to the original: the passage 
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from the remote “as if” Lorenzian model to the “close” Tinbergen 
one has resulted in a confusion between the properties of the model and 
those of the original [78, 80]. 

Von Holst [e.g., 86, 87, 150], considering the relations between the 
central nervous system and peripheral sense organs, has developed a 
theory of “Reafferenz” which bears similarities to the feedback loops 
of control systems. He claims that his “functional schemata, constructed 
in order to illustrate definite causal relationships, are quite abstract, 
although the consequences they predict are concrete and experimentally 
verifiable.” This scheme is being applied in a number of contexts on 
the boundary between physiology and behavior study. 

Synthesis and analysis. Analysis alone cannot provide an under- 
standing of biological organization—wholes as well as parts must be 
studied [165]. The importance of resynthesizing the products of analysis 
must therefore be stressed. It enables emphasis to be laid on the inter- 
relations between factors, and thus permits the adequacy of the initial 
analysis to be assessed. Further, it provides a means of envisaging the 
behavior of the animal as an organized whole, and thus not only com- 


plements the initial analysis of causation, but also shows up the functional 
relations between the parts. 


THE DESCRIPTION AND CLASSIFICATION OF BEHAVIOR— 
SOME BASIC CONCEPTS 


General 


Just as the systematist of organisms has “to break up the almost 
unlimited and confusing diversity of individuals in nature into easily 
recognizable groups, to work out the significant characters of these units, 
and to find constant differences between similar ones” [147], so the 
ethologist must break up the complex stream of behavior into units suit- 
able for study, and classify these into groups for ease of reference. A 
constant awareness of the problems involved in description and classi- 
fication is essential, for the future course of research may depend on the 


choice of units, and the criteria by which they are recognized become 
the bricks from which hypotheses are built. 


Description and Classification into Behavior Types 


; The terms “response,” “reaction,” and so on can refer either to a 
single unique event occurring over a limited space of time, or to a group 
of such events, classed together as having certain characteristics in 
common. There is a reciprocal relation between the criteria of the be- 
havior classes, and the way the unique events are described in the first 
instance. Hither the description will be influenced by the way the be- 
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havior is to be grouped or, since the criteria of the behavior classes must 
be drawn from the data available, the nature of the classes will be in- 
fluenced by the description given. In practice, descriptions of behavior 
fall into two main types: 

Physical description. For some purposes, an adequate description can 
be given in terms which refer ultimately to the strength, degree, and 
pattern of muscular contractions (or glandular activity, change in some 
other physiological property, etc.). In practice such “physical descrip- 
tions” are often limited to patterns of limb or body movement, for a 
complete description would be not only impracticable but also un- 
necessarily refined and cumbersome. Terms such as “knee jerk,” “posi- 
tive supporting reaction,” “sleeping posture” are of this type. 

In classifying on the basis of such descriptions, we are grouping 
together similar spatio-temporal patterns of muscular contraction— 
though the classification will depend only on selected characteristics, 
others being ignored. The choice of the characteristics employed should, 
of course, have an objective justification, but they may vary greatly in 
complexity. Thus “crouching” by a bird may be characterized merely 
by a certain degree of bending in the tarsal joint, but in other cases a 
complex spatio-temporal pattern of muscular contractions (e.g., as in 
tail flicks), or simultaneous or sequential occurrence of a number of dif- 
ferent component patterns (e.g., as in the courtship postures of birds), 
may be used. In this latter case the selection of criteria is further com- 
plicated by the disappearance of some components when the response 
is given at low intensity. The usual practice here is to describe low and 
high intensity forms and indicate the intermediate variations—a method 
roughly comparable to that used by taxonomists for clines [94], but often 
leading to the description only of rare extreme cases. 

Description by consequence. A second method for the description 
of behavior may be called “description by consequence.” “Picking up 
nest material,” “pressing down the lever,” or “approaching” do not 
refer to particular patterns of muscular contraction but cover all patterns 
which lead (or could lead, and there is room for ambiguity here) to the 
specified result. Descriptions of this type are normally used when the 
behavior involves orientation to objects in the environment, and when 
the motor patterns, though leading to a constant result, are themselves 
diverse. They thus tend to be used especially for “appetitive” [33, see 
P. 572] behavior. 

There are two advantages in descriptions of this type. First, a brief 
description may cover a multitude of motor patterns each of which is 
variable—thus “approaching” covers hopping, walking, flying, sidling, 
etc.: they therefore provide an essential shorthand. Naturally each be- 
havior-class characterized by consequence could be subdivided into 
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smaller groups characterized by physical description. The number of 
these may be very limited. Thus the Herring Gull (Larus argentatus) 
uses only one method for retrieving eggs which have rolled out of its 
nest—it puts its beak beyond them and tries to maneuver them back 
under its breast. In spite of the inefficiency of this technique, the bird 
never attempts to use its apparently more suitable webbed feet or wings 
[220]. In other cases the number of smaller groups characterized by 
physical description, into which a behavior class characterized by con- 
sequence could be divided, may be large. Even within one method of 
locomotion it would be theoretically possible to specify an unlimited 
number of time units in each of which the pattern of muscular contrac- 
tion was different from that in every other, and to group these in an un- 
limited number of classes. However, few of these classes would have 
objective justification and few would be of any help in analyzing the 
behavior. This is a reflection of the fact that extensive variability of 
motor patterns does not necessarily mean extensive variability of mecha- 
nisms—just as, although touching a point with the finger could involve 
an infinite variety of motor patterns, it involves localization in only three 
dimensions. 

A second advantage of descriptions by consequence is that they often 
call attention to essential features of the behavior which may not appear 
in physical descriptions—such as orientation with respect to the environ- 
ment, or responsiveness to external stimuli. Thus an animal’s behavior 
may change radically (e.g., from one type of appetitive behavior to 
another) with little change in the motor pattern; or, alternatively, a 
radical change in the patterns of muscular contraction (e.g., from flying 
to walking) may be, for some purposes, of little significance. Description 
by consequence is thus often essential for a full understanding of the 
behavior. 

There are, however, also dangers in descriptions of this type. First, 
the manner in which categories of behavior are described may have im- 
portant consequences on subsequent hypotheses—whether, for instance, 
we say “entering the goal box” or “escaping from the alley.” 

Second, as stressed by Meehl and MacCorquodale [148], such descrip- 
tions are susceptible to overinterpretation. The statement “the rat learned 
to press the lever” can be interpreted as pure description, but can also 
be overinterpreted to imply reciprocal relations between the various be- 
havior classes by which the lever could be depressed, such that a decrease 
in frequency of one is followed by a compensatory increase in another. 
This may be so, but does not necessarily follow from the initial descriptive 
statement. As we have seen in the case of the egg-rolling response, the 
number of alternative response classes which can be used in bringing 
about a given consequence may be surprisingly limited. 
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Whichever type of description is used, it inevitably involves the re- 
jection of some of the data and a selection (arbitrary, intuitive, or 
rational) of the features to be studied—as soon as we move away from 
the initial cinematographic record of the behavior we are rejecting in- 
formation. Just because description and classification involve rejection 
of data and the selection of criteria, there are always the dangers that 
the essential data are rejected, that the units used are irrelevant, too 
small or too large for the task in hand. 

For instance, descriptions of reproductive behavior in terms of “ma- 
ternal behavior,” “sexual behavior,” etc., are inadequate for a proper 
analysis of the stimuli controlling them [216]. Similarly, the understand- 
ing of the apparently “insightful” tool-using behavior of primates is 
greatly facilitated by examination of its components [185]. 

On the other hand, insufficient rejection of data may lead to a hope- 
lessly confusing mass of detail, and divert attention from the essentials 
of the problem. Thus any attempt to describe precisely the patterns of 
muscular contraction involved when a fighting bird strikes its adversary 
with its beak would be hopelessly difficult because of the variations in the 
initial posture and relative positions of the combatants—the only possible 
criteria by which the behavior can be categorized are of the consequence 
type. Similarly, Nissen [162], as a result of his work with chimpanzees, 
has argued that there is a logical necessity for descriptions of this type 
in much complex behavior. 

The two types of description are thus not always alternative. Descrip- 
tion in terms of muscular contraction is unsuitable in some cases because 
of the complexity of the data which it provides, and in others because 
the data cannot be subdivided into classes with an objective validity. In 
other cases, description by consequence is equally unsuitable. Sometimes 
—as in the threat and courtship postures of birds, which involve both a 
relatively stereotyped motor pattern and an orientation with respect to 
the environment—both must be used. 

We must now consider the ways in which these behavior classes can 
be grouped together. 


Classification of Behavior Types 


The classificatory systems used by the systematist of organisms are 
based on common characters—the characters being chosen in such a way 
that they indicate phylogenetic relationships between groups. In classify- 
ing behavior types it is necessary to use systems which lay emphasis on 
causal and functional relations as well as on phylogenetic ones. Char- 
acters derived solely from physical descriptions are sometimes of little use 
for the first two purposes, and other types of criteria must be employed. 
A brief survey of these will serve to introduce some basic concepts. 
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Four principal systems of classification are used: first, a broad divi- 
sion into appetitive and consummatory behavior, and then further inde- 
pendent classifications according to causation, function, and evolutionary 
and ontogenetical history. Although any particular pattern of behavior 
can be examined by any of these methods, the classificatory systems them- 
selves must, of course, remain independent. Examples of behavior 
grouped by one set of criteria need not share characteristics relevant to 
one of the others. 

Like its structure, the behavior of each species has undergone a 
greater or lesser degree of evolutionary differentiation. There is thus no 
a priori reason why the categories useful for one species should neces- 
sarily also be useful for another; once again, comparison is necessary 
before generalization is possible. In practice, a degree of compromise is 
often necessary—the most useful categories may be those which are 
applicable over a moderately wide field with a moderate degree of pre- 
cision. Further, categories useful at a given stage of research may later 
become superfluous: many of those discussed below have proven value as 
working tools but may not be fundamental. 


Appetitive Behavior and Consummatory Acts 


The value of moderately precise categories can immediately be seen 
in the division of behavior into “appetitive” and “consummatory”—the 
former refers to the variable initial “searching” phase of a behavior 
sequence, the latter to the final, more or less stereotyped and relatively 
simple activity [Lorenz, e.g., 128; Tinbergen, 216]. This distinction, 
first made by Sherrington [194] and Craig [33], is not absolute but one 
of degree, and many responses do not fit easily into either category [76]. 
Indeed, since the behavior of animals has developed by innumerable 
routes in adaptation to innumerable environments, we must expect any 
dichotomous classification of this sort to mark the extremes of a con- 
tinuum—absolute dichotomies will be exceptional. It must also be noted 
that a form of behavior is labeled as appetitive or consummatory by its 
properties as a piece of behavior: there is no necessary implication that 
all examples of appetitive behavior have similar causal mechanisms [cf- 
Lehrman, 115]. Nevertheless, the distinction between appetitive and 
consummatory behavior is a useful one. 

Each type of appetitive behavior shown by an animal has three 
characteristics which may be a help in labeling it—the motor pattern (s), 
the orientation component, and the stimuli to which the animal is par- 
ticularly responsive while showing the behavior. Thus a feeding Great 
Tit (Parus major) may hop (motor pattern) over the ground (orienta- 
tion component) “looking for” (to use a convenient shorthand without 
teleological implications) beechmast (stimuli to which it is particularly 
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responsive) [76]. In different kinds of appetitive behavior each of these 
characteristics may vary independently of the others—e.g., a nest-build- 
ing Great Tit may hop over the ground “looking for” a piece of moss. 
There are, therefore, grounds for believing that they depend on more or 
less independent mechanisms. 

The nature of these three characteristics has not yet been discussed 
in detail. The “motor pattern” can usually be described fairly easily, but 
if it consists primarily of locomotion it may give no indications of the 
nature of the behavior unless other patterns (e.g., snarling and other 
facial expressions, crest-raising) are superimposed. “Orientation com- 
ponent” usually involves a spatial relation to environmental objects, and 
may be an important characteristic of the behavior. Of the three, “stim- 
uli to which it is particularly responsive” is the most difficult to investi- 
gate but perhaps the most fundamental, for it is by this that most forms 
of appetitive behavior are ultimately categorized.* Thus if a Great Tit; 
which had previously been pecking at nest-material, begins to peck at 
food and ignores the nest-material, this is an indication that it is now 
showing a new type of appetitive behavior. Of course, the use of this type 
of characteristic involves observation of the behavior over a considerable 
time span, and “natural experiments” of the type indicated above often 
have to serve instead of precise tests of responsiveness. Further, the cri- 
terion is relative—the animal is “particularly responsive.” Whatever a 
Great Tit is looking for, it will nearly always respond to a flying hawk, 
but it is not necessarily useful to say that Great Tits are nearly always 
showing “predator appetitive behavior.” This example shows that a 
classification of this type, though often valuable, cannot be absolute. 

The same three characteristics can also be identified in many con- 
Summatory acts. Thus Lorenz and Tinbergen [130], in an early paper, 
identified distinct motor and orientation components in a response—the 
“egg-rolling” of the Grey Lag Goose (Anser anser)— which is near the 
consummatory-act end of the appetitive behavior-consummatory act con- 
tinuum. The two are distinguishable in that the former (the Fixed Action 
Pattern, FAP, or Erbkoordination)? is elicited but not guided by external 
stimuli—and, once started, always goes through to completion—whereas 
the orientation or “taxis” component depends continuously on environ- 
mental stimuli. Further, with much consummatory behavior it is possible 
to identify “consummatory stimuli” [e.g., “food in stomach,” “eggs 
against brood patches”—see 16, 78, 207, 209] which are responsible for 


“It is for this reason that it is often convenient to label appetitive behavior 
by the behavior which usually follows it. This is not teleology. 
N ° The terms Instinkthandlung and Instinktbewegung were formerly both used 
in this context. The former (= instinctive activity) is now often used to refer to 
the combination of FAP and taxis component. 
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the subsequent fall in motivation and may be compared with the “stimuli 
to which the animal is particularly responsive” when showing appetitive 
behavior. It must be emphasized, however, that the identification of these 
characters in some types of consummatory behavior does not mean that 
they can be distinguished in all. There has been a tendency to regard the 
egg-rolling response as a type of all consummatory behavior, but this is 
not justifiable. Often, for instance, the stimuli eliciting the Fixed Action 
Pattern are the same as those producing the orientation; and many 
consummatory acts, such as swallowing and ejaculation, lack an orienta- 
tion component. Similarly, all cases of a fall in response strength are not 
due to the perception of new consummatory stimuli—they may be due to 
the disappearance of the eliciting stimuli, habituation, and so on. 

Although the distinction between Fixed Action Pattern and orienta- 
tion component is not absolute, the isolation of FAPs led to important 
advances in ethology, for here is a unit which can be easily recognized 
and studied. Although relatively complicated, consisting usually of a 
temporal pattern of muscular contraction, an FAP cannot be split up 
into successive responses depending on qualitatively different external 
stimuli: in other words, it is not a chain reaction [213]. Since FAPs are 
relatively stereotyped and characteristic of the species, they have provided 
excellent material for phylogenetic studies. Lorenz [e.g., 128] has also 
emphasized their apparent spontaneity (see p. 583) and the way in which 
their performance may act as a reinforcing agent in learning. 

In a number of cases FAPs have been shown to be “inborn.” Since 
the way in which ethologists have used the term “inborn” has caused 
some confusion recently, elaboration is necessary. 

The appearance of any character of behavior depends on a number 
of factors including (a) the genetic constitution of the individual and 
(b) his past environment. The terms “inborn” or “innate,” as Spurway 
[200] has pointed out, should strictly speaking be reserved for differences 
between characters, not for the characters themselves. Thus, in the ideal 
case, differences in behavior between two birds reared in identical en- 
vironments can be said to depend ultimately on inborn differences. 

In practice, ethologists have often used “inborn” to mean not 
learned—i.e., not influenced by conditioning, imitation, or those other 
processes commonly grouped together as learning. This of course is 
etymologically inaccurate, and “not learned” would be a better term. 
Although the behavior differences between two animals brought up in 
identical environments must depend ultimately on inborn differences, the 
behavior itself may be “learned.” A valuable critical discussion of these 
terms is given by Lehrman [115] and Schneirla [188; see also, 70, 136]. 

The terms “inborn” and “not learned” have been applied both to 
motor patterns and to the relations between motor patterns and stimuli. 


Some Recent Trends in Ethology 575 


In the former case “practice” sometimes occurs before the movement 
appears in its final form, but the direction of the “improvement” does 
not depend on external reward: ethologists have sometimes grouped 
such cases together with those in which there is no practice. When “not 
learned” and “inborn” refer to the connection between motor pattern and 
stimulus they do not preclude the possible importance of learning proc- 
esses which do not involve the specific behavior in question. Thus ca- 
naries with no experience of nest-building or nest material, if provided 
with material at ten months, will pick it up within a few seconds and 
carry it to a possible nest site [Hinde, unpublished observation]. This 
does not mean that learning plays no part in the development of (e.g.) 
the sensorimotor skills involved: what is significant is that the canary 
picks up the material and does not (e.g.) court it; and that it carries it 
to the nest site, not to the center of the floor. 

There are obvious disadvantages in defining “not learned” behavior by 
exclusion, especially when there are no clear limits to the excluded group, 
and there are no a priori reasons for thinking that the group thus de- 
fined is homogeneous [cf. 19]. Further, since the ontogeny of behavior is 
invariably complex, such a dichotomous classification inevitably leads to 
difficulties, Nevertheless, the importance of behavior which has not been 
influenced by specific learning processes is immediately apparent to any- 
one familiar with animals: the extent to which examples of such be- 
havior share common properties is being elucidated by research. 

Finally it must be emphasized that the real distinction comes not 
between learned and unlearned behavior, but between learning and the 
other factors that influence behavior. In fact the development of be- 
havior always involves a complex interaction of genetic and environ- 
mental factors, into which learning may or may not enter [see also 83, 
222]. 

One point, relating properly to an earlier stage in the discussion, can 
conveniently be made here. Before a classification of behavior can be 
made, the size of the units must be determined. How far is the analysis to 
proceed before the resulting portions are classified? In the case of con- 
Summatory behavior, the isolation of the Fixed Action Pattern provides 
a convenient practical answer to this, for the movement 1s relatively 
stereotyped and cannot be analyzed into parts which are elicited by 
stimulus situations different from those which elicit the whole. fh 

Subsequently more detailed analyses may, of course, subdivide the 
FAP of consummatory behavior into smaller units, such as “acts 
[180] or “components” [75, 79, 143]. Russell et al. [180] define an 
“act” as “a set of observable activities in different effectors (e.g., mus- 
cle tensions), regularly observed in combination (and thus not analyzable 
Into separately occurring components . . - ), hence recognizably dif- 
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ferent from other such acts observed in the same species”: different acts 
may be elicited by the same external stimulus situation, and may in fact 
constitute stages in intensity of the same response® (super act, in their 
terminology’). 

With appetitive behavior the problem is more difficult, for alterna- 
tive motor patterns and orientations may be used. Indeed, few types of 
appetitive behavior have so far been studied in detail by ethologists and 
none is known so well as that shown by a rat in a maze. [But sce 140, 
233.] However, a sequence of appetitive behavior can often be divided 
into a series of links, each with a single set of consummatory stimuli (i.e., 
in each “link” the animal is responsive to a particular set of stimuli). 
Each such link can legitimately be treated as a unit, in spite of the (lim- 
ited) variability of the motor behavior (which is, of course, itself a major 
problem), because, like the fixed action pattern of consummatory be- 
havior, it has a set of eliciting causal factors specific to it. In other words, 
the sequence of appetitive behavior as a whole is a chain response, each 
link having its own eliciting (and often consummatory) stimuli [76]. 
Often the consummatory stimuli of one link seem to be the eliciting 
stimuli of the next. 

“Reflexes.” The difference between “instinctive” behavior and “re- 
flexes” has often been emphasized in order to underline the importance 
of changes in threshold of some instinctive acts in contrast with reflexes. 
This distinction has, however, caused some confusion because there is 
no agreement between those ethologists [e.g., 216, 207] who restrict the 
term reflex to simple S-R sequences in which changes in threshold are 
insignificant, holding that it becomes valueless if used in a wider sense, 
and other workers who include all S-R sequences as reflexes, to be in- 
vestigated by the methods of “reflexology” [e.g., 99]. 

Further confusion has arisen because some ethologists have contrasted 
“reflex” (in their sense) with “response” (signifying S-R sequences of 
greater complexity in which changes of threshold are important); 
whereas for other workers the “response” forms part of the reflex, and 
cannot be contrasted with it. 

° The terms “response” and “reaction” are used by some ethologists in an ex- 
tremely broad sense—in fact almost any change in behavior or responsiveness (i.&, 
motor pattern, orientation, or readiness to respond) may be so described. Thus 
Tinbergen [214] states that a reaction “may be a special motor response in one 
case, a change in readiness to react with a special motor response in another case, 
or even a change in readiness to respond with a whole pattern of functionally 
related movements in many other cases.” 

"Although the “act,” in the same sense indicated by these authors, may tum 
out to be a useful unit of behavior, the employment of this particular term may 
lead to confusion. The term “consummatory act,” which has been widely used by 


ethologists, usually refers to a “super act” in the terminology of these authors, and 
may contain a number of “acts.” 
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The difference between instinctive and reflex behavior is, as Tin- 
bergen [216] and Thorpe [207] imply, of course only one of degree. 


Causal Classification 


The various types of behavior shown by an animal can be classified 
according to the causal factors on which they depend, activities which 
share causal factors being classified together. Thus all activities whose 
frequency or intensity is significantly increased by male sex hormone 
can be grouped together as male sexual behavior. All activities simi- 
larly influenced by the stimulus situation “rival male” can be described 
as “agonistic behavior” [cf. 189]. This type of classification is essential 
for an understanding of the behavior of the whole animal, and it works 
well in practice. There are, however, some difficulties. Two methods are 
used to assess whether patterns of behavior share causal factors. One is to 
administer the factor in question and see whether the patterns in question 
both increase in frequency. The other is to investigate the temporal cor- 
relations between the appearances of various activities [11, 95]. In either 
case, a positive result does not necessarily mean a direct effect of any 
factor on both patterns [e.g., 3]. Further, the nonoccurrence of a pattern 
after administration of the factor does not mean that the factor is with- 
out influence on the behavior in question— additional factors may be 
necessary before the behavior appears. r 

The term “intention movement,” used for movements which are in- 
complete by comparison with other examples of the “same” (see p. 569) 
response, may be included here, since it may be inferred that the in- 
completeness is due to a deficiency in the “internal” (central or periph- 
eral) or external causal factors, or to the presence of factors for incom- 
patible behavior. This term, however, though extremely valuable in 
particular cases, has never been given a satisfactory general definition 
[38; see also p. 595]. 


Functional Classification 


Activities may be classified according to the functions which they 
Serve—e.g., “threat,” “courtship,” “hunting,” and so on.’ Two rather 
different types of criteria are, however, used here. If interest is centered 
on the evolution of behavior, the statement that a type of behavior has 
a given function implies that the behavior is maintained in the repertoire 
of the species by selection acting through a consequence related to the 


€ B fe 
*Such terms correspond to words such as “Jegs,” “eyes,” etc., in morphology. 
and there is no implication 


A structure is classed as a leg because of its function, 1 c 

that the legs of, e.g., arthropods and vertebrates are related either embryologically, 
1€., causally, or phylogenetically. A similar structure which has a different function 
is given a different name, e.g., maxilliped. 
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function in question (e.g., copulation has a function related to repro- 
duction). On the other hand, if we are interested primarily in the func- 
tion itself, we may argue that a pattern of behavior has a given function 
merely because it contributes to a certain biological end. For instance, if 
we were interested in the behavior patterns of passerine birds which func- 
tion in reducing disease, we might include territorial behavior which, by 
spacing the birds out, probably helps to prevent the spread of epidemics. 
But the territories of many species are so large that it seems improbable 
that a small change in territory size would make any significant difference 
to disease, and so selection cannot be acting through disease to maintain 
territorial behavior in its present form [81]. 

Tn practice, when any one species is being considered, a functional 
classification of behavior very often corresponds to a causal one: con- 
sidered from the point of view of evolution, this is not hard to under- 
stand—the mechanisms which underlie adaptive behavior need be far less 
complex if functionally related activities share causal factors than if the 
various causal factors are distributed at random among the functional 
groups. It is for this reason that ethologists often use functional-sounding 
words, e.g., reproductive behavior, parental behavior, when they are 
referring to a category which they believe to be related in terms of causal 
factors. Often the functional classification is a good guide to the causal. 
This is not always the case, however, and in many animals it may be 
quite unjustifiable to talk about a “reproductive instinct” if instinct is 
used in the causal sense, for sexual and parental behavior may depend 
on quite different causal factors [76]. Causal analysis must precede 
causal classification if functional and causal categories are not to be 
confused. Further, as Tinbergen [218, 220, 221] has shown, the same 
functional category, courtship, may rest on quite different causal bases 
jin different animals; and great care is necessary in generalizing from 
species to species. 

Functional categories are often used to include behavior patterns 
from unrelated species. This is quite justifiable just so long as the cate- 
gories are used only in discussions of function. Care is necessary, how- 


ever, for almost identical behavior elements may have dissimilar func- 
tions in different species. 


Historical Classification 

Two quite distinct systems must be recognized: 
i 1. Classification according to source. Here patterns of behavior be- 
lieved to have a common historical origin are grouped together. This is 
much used in the study of the evolution of behavior—especially the 


evolution of Fixed Action Patterns. The basic criterion for grouping t0- 
gether is then similarity in the pattern of muscular contraction, but as 
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in the systematics of organisms, due allowance must be made for possi- 
bilities of convergence, divergence, etc. [83]. 

2. Classification according to method of acquisition. Here behavior 
classes are grouped not according to their historical source but according 
to the nature of intervening changes which have occurred—for instance, 
“learned,” “ritualized.” 


Finally, mention must be made of categories of behavior based on 
descriptive criteria other than descriptions of movement patterns. Such 
groupings have no implications about causation, function, or evolution. 
The division of behavior into appetitive and consummatory (see p. 572) 
is a classification of this type. Kühn’s classification of orienting mecha- 
nisms is another—applicable to a wide variety of organisms, it is based 
mainly on descriptive characters, and must not be interpreted as having 
any deeper significance. Many other terms applying to a wide variety of 
species are also of this nature—‘territorial behavior,” “migratory be- 
havior,” and so on. “Intention movement” when used to refer to a move- 
ment which is incomplete by comparison with another, otherwise similar, 
movement is also of this type (see p. 577). 


The Independence of the Classificatory Systems 


Although the classificatory systems discussed in the previous section 
are basically independent of each other, possibilities of confusion con- 
stantly arise. Perhaps the best known examples have arisen through mis- 
use of the term “instinct,” which is used sometimes for a functional and 
sometimes for a causal category of behavior, sometimes for a mechanism 
and sometimes for factors controlling that mechanism. As Kortlandt 
[106] has recently pointed out, Lloyd Morgan’s classification of instincts 
involves such a confusion, the top-level instincts being functional cate- 
gories, the rest carrying implications of a causal nature. j 

Indeed because, for reasons we have already seen, functional and 
causal categories often overlap, the temptation to confuse them is often 
severe. This is particularly dangerous when the functional categories are 
nothing more than descriptive generalizations. Reference to “self- 
preservative” as opposed to “reproductive” instincts, for instance, both 
confuses functional and causal categories and implies something causally 
in common between, for example, feeding and preening. 

Another cause of confusion is the assumption that members of a 
descriptive class also share common characteristics of other types. De- 
scriptive categories are often useful tools for research even when they 
carry no references to other characteristics. Kühn’s classification of taxes, 
for instance, is primarily (though not exclusively ) descriptive, and to 
argue that such “a preliminary classification has heuristic value only if 
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the members of a given class are thought to be representative of similar 
dynamic processes which can be investigated” [115] merely reveals a 
misunderstanding of the classification employed. 


NATURE OF THE SYSTEM 


General 


Ethologists have not all concentrated on a precise analysis of the 
behavior of only one species in one particular situation, but have tackled 
a variety of problems posed by animals as they found them in nature. 
This has required flexibility in the concepts used. The optimum balance 
between rigidity and plasticity of concepts changes with the diversity of 
phenomena studied and with the stage of the analysis reached. Fre- 
quently, some plasticity is still desirable, for an attempt at overrigid for- 
malization would both narrow and stultify research. As the subject de- 
velops, some lines require more precisely defined concepts; in their new 
form these may then become valueless in other contexts where analysis 
has not proceeded so far, but they may nevertheless remain related to 
their forerunners of broader application. 

So far, ethologists have attempted precise quantification at only a 
few points—though the number is now rapidly increasing. For the most 
part only tentative suggestions about “what affects what,” qualified by 
indications of the direction of the effect, have been made. 

The terms “independent,” “intervening,” and “dependent” variables 
have seldom been used in the work discussed in this chapter, but the 


variables which have been used can conveniently be discussed under 
these headings. 


Dependent Variables 


Although the behavior studied by ethologists varies greatly in com- 
plexity, it is generally at the level of responses of the whole animal. For 
instance, in analyzing the stimuli which elicit the pecking response of 
young Herring Gulls (Larus argentatus) Tinbergen and Perdeck [227] 
used as dependent variable number of pecks per unit of time. 

Studies such as this, where the behavior observed consists of a rela- 
tively stereotyped pattern, differ from experiments involving lever press- 
ing and similar tasks. Although each peck is, of course, different from 
every other peck in at least some respects, these differences lie mainly 
in the orientation components; other features are nearly constant from 
one response to the next. When the response is distinguished in terms of 


achievement, e.g., lever pressed down, it need have no such constant 
features [cf. 131]. 


Some Recent Trends in Ethology 581 


Of course, this does not imply that all ethologists aiming at some 
degree of quantitative precision have used a relatively stereotyped re- 
sponse—what is measured must depend on the nature of the case. In 
experiments on the orientation of homing birds, for instance, the mean 
deviation of the bird’s course from the homing direction may be used 
[e.g., 146]. Because of the diversity of the behavior studied, it is not 
possible to generalize further about the dependent variables used by 
ethologists: some further examples are given below. 


Independent Variables 


The following are the principal types of independent variables which 
have been manipulated in ethological studies: 

1. Stimuli 

Verplanck [231] has pointed out that the term “stimulus” has been 
used in at least four senses by psychologists: 


Sense I. For a part, or a change in a part, of the environment. 

Sense II. “Any form of energy which elicits a response” —the kind 
of response, organism and prevailing conditions being unspecified. 

Sense III. A class of environmental events which control a specified 
activity of the organism according to specified laws. 

Sense IV. As II, but including hypothetical or inferential classes of 
physical events (usually intra-organismic). 


Ethologists have used stimulus in all these ways. For instance: 


Sense I. “We want to know what stimuli the sense organs of a given 
animal can receive” [216]. 

Sense II. “Motion may be a powerful stimulus.” 

Sense III. The term “sign stimulus” is always used in this way. Thus 
Tinbergen, after discussing a number of examples showing that 
“As a rule, an instinctive reaction responds to only very few stim- 
uli, and the greater part of the environment has little or no in- 
fluence . . . ” [216, p. 27] says “Russell . . . has called these 
essential stimuli ‘sign stimuli ” [216, pp- 36, 172]. Thorpe [205] 
defines “releaser” [which in his usage is synonymous with sign 
stimulus, but see p. 58] as “Any specific feature or complex of 
features in a situation eliciting an instinctive activity or mood. 

Sense IV. “That external stimuli sent out by some object can evoke 
definite instincts or activities in an animal, nobody will call in 
question; that internal stimuli are also playing a role in arousing 
these behaviour elements appears from the often observed fact that 

an animal which has not performed a certain activity for a long 

time, because it did not encounter an object adequate for pro- 
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ducing a reaction to react to, at the end may show this activity 
towards a completely inadequate object” [10]. 


That a single term may be used for a number of different categories 
in this way may be unfortunate, but it has not in practice led to any 
difficulties. 

Direct quantitative treatment of stimulus variables is difficult, and 
has been possible in only a few cases where size, distance, or some sim- 
ilar property was concerned [e.g., 13, 95, 226]. The very nature of the 
stimuli to which animals respond is often incompatible with precise 
measurement, and qualitative descriptions such as “red belly underneath” 
are necessary. Stimulus situations of this type can at present be evaluated 
only in terms of the behavior they evoke. 

2. Among other independent variables which have been manipulated 
in ethological studies are: (a) previous experience, (b) hormone dosage, 
(c) individual or specific characteristics, (d) age and/or phase of life 
cycle. 


Intervening Variables 


A number of terms used in ethological literature refer to intervening 
variables. In only a few cases have formal definitions of these terms been 
attempted, and their meaning is indicated either by informal exposition 
or by usage. They can be subdivided into two groups: 


1. Terms employed in the hydraulic models previously used by 
Lorenz and Tinbergen. 

2. Hypothetical constructs” [in the sense of MacCorquodale and 
Meehl, 131] describing hypothesized functional parts or aspects 
of the nervous system. The extent of the neurophysiological impli- 
cations carried by such concepts varies; in some cases, they are 
almost absent. 


Terms used in hydraulic models: the problem of spontaneity. The 
“hydraulic” models used by Lorenz and Tinbergen in discussing prob- 
lems of motivation were mentioned on p: 567. Lorenz’s model consisted 
of a “reservoir” of “reaction specific energy” which flowed away in 
action and was subsequently replenished from an unspecified source. 
Tinbergen [e.g., 216] has used a basically similar model in speaking of 
“motwational impulses” which flow from one “nervous centre” to 
another, and are eventually expended in action. (The concept of reaction 
specific energy is similar in some respects to Skinner’s “reflex reserve.”) 

Though these models have never, as Kennedy [99] suggests, formed 
the “central idea” of ethology, they have often been useful in providing 
a “working hypothesis of a type that helps put one’s thoughts in order” 
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[216]. Their present status has been discussed elsewhere [78, 80; see also 
p. 567]; they belong to an early phase of research, and as analysis pro- 
ceeds, they have become less useful. 

One concept which must be mentioned here is “spontaneity.” This 
term has been used in at least two different ways by ethologists, and 
some confusion has resulted. Lorenz, struck by the extent to which many 
inborn behavior patterns are independent of external stimulation, postu- 
lated “the endogenous generation of motor stimuli” (roughly correspond- 
ing to the accumulation of reaction specific energy in his model). It is 
to this supposed “endogenous generation” that the term “spontaneity” 
sometimes refers. 

Tinbergen, however, uses “spontaneity” in a rather different way— 
namely, to mean “without apparent external stimulus.” He lists three 
types of internal factors which may play a part in the causation of be- 
havior: (a) hormones, (b) internal sensory stimuli, and (c) intrinsic 
central nervous factors. It is to (c) only that Lorenz refers when discuss- 
ing spontaneity. In a 1952 paper [217] Tinbergen distinguishes clearly 
between these two uses of the term. “ ‘Spontaneity’ of the animal as a 
whole, meaning behaviour independent of factors external to the animal, 
is an established fact; ‘spontaneity’ of the central nervous system, a part 
of the animal, has not been proved, though it seems probable.” Tin- 
bergen also emphasizes the important distinction between the initiation 
and the coordination or patterning of movements: of course one may 
depend on external stimuli while the other does not. 

By “intrinsic nervous factors” Tinbergen presumably means changes 
in the central nervous state which occur independently of a change in 
the extraneural states and stimuli, and which lead to a change in re- 
sponsiveness. The existence or importance of such factors cannot be 
discussed here [see 11, 172 for recent reviews]. 

Other hypothetical constructs. Since ethologists are interested not 
only in the prediction and control of behavior, but also in the physiologi- 
cal mechanisms involved, a number of ethological terms carry neuro- 
physiological implications. : 

Ethologists differ in the degree of neurophysiological specificity which 
they impute to such constructs. As far as the immediate study of be- 
havior is concerned, they often need have none. But some neurophysio- 
logical implications are necessary if the concepts are to be useful as a 
convenient shorthand for discussing, for instance, the common features 
of the mechanisms mediating similar Fixed Action Patterns in related 
Species, or for providing a point of reference for coordination with actual 
or projected physiological studies. In practice such “existence postulates 
—to borrow a phrase from MacCorquodale and Mechl [131]—have 
Sometimes been introduced unnecessarily and surreptitiously, with a con- 
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sequent reduction in the utility of the concepts [80]. In any case, all these 
concepts are highly provisional, and no claim is made except that they 
are useful at the present stage in research. 

Motor coordination mechanism. Any recurring pattern of behavior 
must depend on certain constant features of anatomy or physiology. 
Such features need not be anatomically localized, or even have a constant 
diffuse location—they could involve specific patterns of activity. Thus 
for each “Fixed Action Pattern,” a “motor coordination mechanism” 
can be postulated. This concept has proved useful, especially in compara- 
tive studies, e.g., in discussions of the nature of ritualization. In causal 
contexts it represents an entity which has so far proved rather elusive. 

Innate Releasing Mechanism. IRM corresponds to “das Schema” 
[von Uexkull, 229], “angeborener auslösender Mechanismus” [Lorenz, 
e.g., 121]. This construct was first introduced to account for the observa- 
tions that (a) animals will often respond to a particular stimulus situa- 
tion with a specific group of responses, even though they have never been 
confronted with the situation before; and (b) in such cases, the animal 
responds to only a very limited part of the total stimulus situation. Thus 
Tinbergen and Perdeck [227] describe the IRM as a mechanism which 
is “selectively responsive to a special stimulus situation.” 

To some extent, of course, this is true of all behavior. Hick [quoted 
by Broadbent, 25] has shown that even the human nervous system IS 
simply incapable of dealing with all the information reaching it through 
its sense organs, so that selection must be continuous. The matter is, how- 
ever, raised in a particularly acute form by many animal experiments. 
Thus Lack [112, 113] has shown that a territory-holding Robin (Eri- 
thacus rubecula) will attack or threaten a bunch of red breast feathers 
more readily than a whole stuffed juvenile Robin, which resembles the 
adult in all features except that it lacks the red breast. The red breast is 
more important than all other morphological characters together. Many 
similar cases are cited by Tinbergen [214, 216], and lead to the impor- 
tant concept of “sign stimuli” [178], or essential features of an object 
or situation which elicit a particular pattern of inborn behavior. It seems 
that unlearned” [see p. 57] behavior is characteristically elicited by 
only a few sign stimuli—though later conditioning may, of course, com- 
plicate the releasing situation.” 

The concept of the IRM, however, involves a number of difficulties: 


"The term “releaser” is used primarily for organs which have undergone 
evolutionary adaptation to a signal function. Lorenz uses it for structures used in 
intraspecific communication, but Baerends [10] extended its use to any organ Or 
structure “especially serving to send out sign stimuli,” regardless of whether actor 
and reactor belong to the same species. Strictly speaking, therefore, all objects 
which are instrumental in releasing responses are not “releasers a The use of the 
term releaser has implications about both function and Solinas history. 
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1. The term itself is an unfortunate one because (a) releasing mech- 
anisms are not necessarily innate [e.g., 209], (b) stimuli responded to 
selectively may not merely “release” (see p. 592), and (c) the mecha- 
nism of selective response to stimuli is unlikely to be a unitary one, but 
complex and diffuse.*° 

2. The usual method of studying the sign stimuli eliciting a re- 
sponse is to vary characteristics of the situation which normally elicits 
the response and record the response strength in each case. If change in 
a particular characteristic produces a change in response strength, then 
that characteristic is relevant to the response. It has, however, often also 
been assumed that there must be a mechanism, peculiar to the response, 
selectively responsive to that feature. This, of course, is not necessarily 
the case: responsiveness to objects which contrast with their backgrounds 
is likely to be greater than that to objects which do not for all responses 
mediated by vision—responsiveness to contrast is not a property of an 
IRM specific to any particular response but of the perceptory mecha- 
nisms used in all responses. Properties of sense organs and perceptory 
mechanisms will influence the effectiveness of all stimulus situations inde- 
pendently of the response being studied. This criticism of the IRM con- 
cept, raised by Lehrman [117], is an important one. In reply, Tinbergen 
suggested that “we apply the term (I)R.M. to those cases where a 
stimulus situation has been shown to be typical of one response and not 
of others.” This, however, has rarely been done in the past. 

3. The concept of the IRM has been closely associated with the 
Lorenz-Tinbergen hydraulic models (p. 582), as a result of which it has 
acquired a number of properties which are incompatible with observa- 
tional data. This has been discussed elsewhere [80, 170]. 

Finally, it must be emphasized that the concept of the IRM is bas- 
ically a descriptive one. Reference to IRMs in unrelated species merely 
implies the presence of descriptive similarities in their behavior and not 
in the underlying mechanisms. 4 

Drive. This has been used in at least three ways by ethologists: 

Drive I. Refers to a central nervous state caused by stimuli, hormones, 
etc., and itself causing the animal to behave in a particular way [e.g., 
216, 218]. 

Drive II. Refers to the internal causal factors leading to a given 
behavior—that is, to all causal factors other than those received through 
exteroceptors [157]. á 

Drive III. Refers to all the causal factors influencing behavior, 
Whether internal or external. Thus Thorpe [205] defines drive as “the 


” Baerends [11] emphasizes that the releasing mechanism may include every- 


thing between sense organ and motor coordinating mechanism and that the use 


of the term IRM does not imply that the mechanism is either localized or unitary. 
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complex of internal and external states and stimuli leading to a given 
behaviour." [Tinbergen, 222, uses “motivation” in this way.] One term 
has thus come to be used in three different senses because of the different 
interests of the authors concerned and the different stages which their 
analyses have reached. Very little confusion has resulted so far. 

It will be noted that, in all three usages, the concept of drive as used 
hitherto involves no distinction between exciting and directing factors. 
All drives are regarded as specific to a more or less limited group of 
responses. Thus Tinbergen [218] writes that the “drive can be of dif- 
ferent kinds, each giving rise to a different behaviour pattern.” Similarly, 
in Tinbergen’s hierarchical scheme of nervous centers (p. 582) each 
causal factor (stimulus, hormone, etc.) can influence only a limited 
group of responses. It is possible that ethologists will require a concept 
closer to the learning theorists’ “general drive” to deal with phenomena 
such as displacement activities (see p. 593) and irrelevant drive phenom- 
ena [78]. Neurophysiological data also seem in favor of a move towards 
a “general drive” concept, e.g., the work on the reticular activating 
system of mammals [2] as well as experiments on lower vertebrates [23, 
170]. 

There are, however, other difficulties associated with all three usages 
of “drive,” and in practice the concept has often carried hidden and 
misleading “existence postulates” originating from the Lorenz-Tinbergen 
hydraulic models [80]. At present there is no agreement amongst etholo- 
gists as to how “drive” should be used. Other terms have been suggested 
for particular contexts, where they have rendered the use of “drive” un- 
necessary. ‘Thus “Specific Action Potential” (SAP) has been suggested as 
a direct measure of responsiveness to a given stimulus [78]. Since the 
intensity of a response may be influenced in a number of ways, ¢.8: 
“motivational” factors, learning processes, presence of compatible or in- 
compatible response tendencies, etc., SAP depends on many variables. 
Specific Action Potential was originally suggested to replace Lorenz’s 

Reaction Specific Energy” [205]. Since the concept of RSE is obsolete, 
SAP is unlikely to be of much use in that context and, as used here, has 
no connection with RSE or any similar construct. 

“Tendency” has been used in the preliminary analyses of agonistic 
and courtship behavior (see p. 590) where causal factors for two oF 
more incompatible types of behavior are simultaneously present. An ani- 
mal can be said to have a strong tendency to behave in a particular way, 


™Baerends [9] uses “Trieb” to mean potential “Drang” (= “Stimmung” or 
mood). “Drang ist aktivierte Trieb, Trieb ist potentielle Drang.” This use of Trieb 
does not correspond to the present use of the English word “Drive” by ethologists, 
though in some contexts it approaches the first of the three uses given above. It is 
often synonymous with Tinbergen’s use of “Instinct.” 
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although the SAP for that type of behavior may be low because of 
another incompatible response tendency [79]. 

Mood. Another descriptive term which, in some senses, comes close to 
drive. 

Tinbergen [213] describes “mood” (“Stimmung”) as a word “which 
does not pretend to be more than a convenient descriptive term to denote 
the ‘psychological condition” of the animal”; and later [214] as “a readi- 
ness to respond with one group of functionally related motor patterns.” 
Lorenz [128] defines it as the animal’s “present internal state of specific 
readiness for certain activities.” Thorpe’s [205] definition refers to a 
“state” of “readiness for action.” Baerends [9] emphasizes that “Stim- 
mung” refers to the state of activity of a group of mechanisms. 

Recently this term has been avoided by ethologists because its sub- 
jective associations in everyday speech have given rise to frequent mis- 
understandings. As it has often been used in practice to refer particularly 
to the animal’s responsiveness to stimuli (e.g., “in nest building mood” 
meant the animal was particularly likely to respond to nest-material if 
it encountered any), it may come to be useful when the relationship 
between susceptibility to stimuli and strength of response once released 
has been worked out in more detail. 

Construction of function forms. It has already been pointed out that 
ethologists, engaged in studying the diverse patterns of behavior shown 
by widely differing species, have not yet attempted to specify precise rela- 
tions between variables in even one case, let alone to make generalizations 
applicable to many species. 

Mensurational and quantificational procedures. No mensurational 
or quantificational procedures are presupposed. 

In the case of independent variables, it is accepted that the great 
majority of the stimuli which are important in nature cannot (at pres- 
ent) be measured against a physical scale; stimuli differing in physically 
measurable dimensions have been used in only a few studies (see p. 582). 

On the dependent variable side, the method of measurement must be 
adapted to suit the particular problem in hand. Three types of procedure 
can be recognized: x 

1. Qualitative judgments of the “more intense, less intense” type. 
These are often sufficient for the initial analysis of complex types of be- 
havior into simpler types (see p. 590). i 

2. Quantal intensity [e.g., 125, 128, 180]. Many responses involve 
more and more “acts” (see p. 576) as the internal or external causal 
factors are increased. For instance, with increasing intensity, the mob- 
bing response given by Chaffinches to owls involves slight movements 
of the head, followed by lateral movements of the body and vertical 
movements of the tail; then these become more intense, the bird begins 
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to call “chink, chink,” and finally the rate of calling increases [77]. It 
is often possible to produce an arbitrary scale of increasing intensity of 
the response according to the components which are present. Although 
it is, of course, not possible to ensure that the various steps on the scale 
are of equal value, such a method is often sufficiently precise and reliable 
for many purposes. 

3. Sometimes some features of the response can be measured against 
a physical scale. At least five varieties of measure may be used—quanti- 
tative intensity, frequency (i.e., ratio of occurrence to number of trials), 
latency, number of occurrences in a fixed period, and duration in a fixed 
period. 

Russell et al. [180] point out that measures of intensity differ from 
the others in that there are as many separate measures as there are oc- 
currences of the act. The remaining measures (“tendency” measures) 
can all be reduced in principle to frequencies, and are related to the 
threshold of occurrence. They therefore suggest that only two behavior- 
ally measurable (dependent) variables are related to each act—Intensity 
and Tendency. Measures of frequency, duration, etc., are all to be func- 
tionally related to “Tendency.” These functional relations are varied, “so 
that it would be reckless at present to pay too much attention to, for 
instance, the types of curves we can fit to the relation between any of 
these measures and a given controlled variable, but quite reasonable to 
use any of them in the attempt to determine whether change in a given 
controlled variable has any influence on the Tendency of a particular 
Acts’ Unfortunately these authors offer no experimental evidence in 
support of their hypotheses. 

As yet few attempts to investigate the nature of the relations between 
different measures of the same response have been made outside the 
field of learning [21, 77]. More such studies are needed (a) in order 
that adequate measures of response strength can be gained, (b) because 
it is necessary to know if the various measures are affected differently or 
differentially by the independent variables, and (c) because such infor- 


mation will give some indication of the complexity of the underlying 
mechanisms, 


ORGANIZATION OF ETHOLOGY 
General 


So far, for reasons we have already seen, no ethologist has attempted 
to build up a formal system. Research has rather been directed towards 
the elucidation of principles, usually stated informally, with clusters of 
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research projects indicating the extent of their validity. Such principles 
are embodied in informal statements, thus: 

“Dependence on only one or a few sign stimuli seems to be char- 
acteristic of innate responses” [216]. 

“The various sign stimuli required for the release of an instinctive 
activity co-operate according to the rule of heterogenous summation” 
[216; see p. 592]. 

“In the breeding season most birds defend territories from other in- 
dividuals of the same species and sex.” 

All such statements require a qualification defining their scope—thus 
the statement about bird territories is true for most male passerine birds 
but not, e.g., for the Emperor Penguin (Aptenodytes forsteri) [202]. The 
degree of generality of any such statement is, of course, a function of the 
quantitative specificity with which it is stated—if principles are to have 
any general application, only a limited degree of precision is desirable. 

The framework of ethological ideas can thus include both purely 
qualitative analyses of broad categories of behavior (e.g., the analysis 
of courtship behavior, see p. 590), where the results are likely to have 
a fairly general validity, and more detailed analyses of particular prob- 
lems, where the precise quantitative results are relevant only to the prob- 
lem in hand (though of course they may point the way to less precise 
but broader generalizations). Both types of approach are necessary for 
an understanding of animal behavior. 

History of ethology in mediating research. The scope and trends of 
ethological research can be seen only by considering the principal general 
topics with which it has been concerned. This has been attempted in 
the following paragraphs, but it must be noted that the selection of 
topics is necessarily an individual one—to some extent it reflects the 
personal interests of the present writer, and other ethologists might not 
agree that the studies chosen represent the growing points of current 
research. Further, those topics lying near (or within) the borders of 
physiology have not been considered. This involves the omission of many 
studies which have influenced ethologists (e.g., the work of von Holst, 
Gray, Lissmann, Weiss, and many others) as well as those in which 
ethological ideas have played an incidental role. Selection has also been 
in favor of European rather than American work, in order to include 
especially references with which the readers of this volume may not be 
familiar: this has involved omission of many writers (e.g., Aronson, 
Beach, Carpenter, Evans, Nice, Noble, Schneirla, and others), whose 
work is extremely pertinent to the subjects discussed. Further, except for 
a few classical studies, recent work has been included in preference to 
prewar work: in some cases, recent reviews have enabled previous de- 
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tailed studies to be omitted from the bibliography. The literature cited 
is thus intended to be representative but by no means comprehensive, 

Finally, references to some of the principles which are emerging 
are intended to show the direction research is taking, and not as formal 
statements. 

Studies of the whole or a large part of the behavior of a species. 
General studies of the behavior of a species under natural or near- 
natural conditions are essential both for the more detailed study of 
limited portions of its behavior repertoire and, of course, for under- 
standing the significance and relation of these parts with respect to the 
whole, Many such studies have been made by zoologists and naturalists 
who would not call themselves ethologists. The following is a selection 
of studies in which an ethological approach (see p. 562) has been 
especially evident: ** arthropods [9, 34, 35, 52, 61, 96, 97], fish [12, 13, 
57, 95, 151, 153, 164, 190, 191, 236], amphibia and reptiles [42, 45, 
46, 107], birds, passerines [4, 8, 38, 63, 66, 74, 90, 112, 120, 126, 143, 
161, 211], other groups [30, 71, 72, 73, 91, 101, 102, 105, 114, 127, 
137, 138, 158, 197, 220, 230, 234, 235], and mammals [36, 41, 43, 
44,47, 118, 163, 166, 184, 192]. 

Qualitative analyses of relatively complex types of behavior (e.g. 
aggression, courtship) into simpler components. This includes a number 
of studies in many of which it has been shown that: 

1. When behaving agonistically, the animal is under conflicting 
tendencies (see p. 586) to attack and flee from its rival. Sometimes one or 
another of these tendencies overrides the other and the animal actually 
attacks or flees. When the two tendencies are approximately equal, the 
animal may either alternate rapidly between the one and the other, show 
an ambivalent (“threat”) posture, or a displacement activity (see 
below). Where a species has a number of different threat postures, each 
is probably correlated with different strengths or relative strengths of 
the two tendencies. The following references deal with analyses of this 
type in fish [12, 95, 153, 155, 190, 191, 223] and birds [4, 74, 75, 79, 
143, 144, 152, 158, 159, 218, 220, 221]. 

2. Courtship. Similarly, courtship occurs when the animal is under 
the influence of conflicting tendencies, though here three are primarily 
involved—to attack, flee from, and behave sexually towards the mate. 
The nature of courtship “displays” depends on the strengths and relative 
strengths of the associated tendencies, 

i In both threat and courtship displays, particular components of the 
displays (e.g., crest-raising, wing-drooping in birds) are often correlated 
with the strength of one or other of the component tendencies. The 
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, "Latin names of species and groups have been omitted from this section in the 
interests of brevity, but may be found in the list of references. 


Some Recent Trends in Ethology Ho 


following studies deal with fish [12, 13, 151, 153, 154, 155, 164, 190, 
191, 223] and birds [74, 75, 79, 93, 126, 143, 152, 158, 159, 161, 218, 
220). 

3. Various complex displays to predators similarly depend on con- 
flicting tendencies [5, 7, 77, 143, 195]. 

This type of investigation thus results in an analysis of the behavior 
into “displays,” “postures,” etc., and of these into “components,” and a 
subsequent separation of the components into groups, each associated 
with a particular tendency (e.g., attacking, fleeing, etc.) and thus 
presumably depending on more or less distinct sets of causal factors 
[79, 221]. The more detailed investigation of these causal factors 
represents a further step in the analysis. The evidence at present avail- 
able indicates that the conclusions which are emerging in this field are 
widely applicable among fishes, birds, and probably, mammals, The 
courtship of other groups may, of course, rest on quite different bases 
[14, 171, 237]. 

Nature of the stimulus situation evoking each response. Of the total 
stimulus situation present at the time, only a very small proportion is 
relevant in the elicitation of a particular “unlearned” response, Such 
stimuli are termed “sign stimuli.” For example, Tinbergen found that 
male Sticklebacks (Gasterosteus aculeatus) would attack very crude 
models which had the character “red belly underneath” but usually 
ignored models which, although closely similar to real sticklebacks, lacked 
this color. “The fish reacted essentially to the red and neglected the 
other characteristics” [164, 216]. Such sign stimuli are usually configura- 
tional, and the response to them not learned (p. 574). One consequence 
of this is that different responses normally given to the same object 
(e.g., courtship responses to the female) may be elicited by different 
characters of that object. (As defined in this way, the statement that a 
character is a sign stimulus for a particular response has not neces- 
sarily meant that it is specific to that response. Thus a color character 
which involves contrast with the normal background is likely to be 
effective in many different contexts—see p. 584.) The following refer- 
ences contain analyses of the stimulus situations eliciting particular re- 
sponses: parental behavior in Ammophila (a hunting wasp) [9], court- 
ship of Lepidoptera and Odonata [26, 133, 134, 135, 199, 226], hunting 
and courtship of spiders [40, 169], feeding in Honey-bees [59], fright 
responses of Minnows [60], schooling of fish [98], aggression and court- 
ship of Sticklebacks [164, 216], parental behavior of Sticklebacks [95, 
193], various responses of Cichlids [12, 190, 191], fighting of Fighting 
Fish [119], courtship of various Syngnathid fish [57], egg-rolling of Grey 
Lag Goose and other species [130, 168], response of young birds to fly- 
ing predators [216], begging of young passerine birds [225], fighting of 
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the European Robin [112], pecking of young Herring Gulls [227], hunt- 
ing in owls [174], response of passerines to owls [68, 77], fighting of 
Chaffinches [142], auditory social signals of domestic hen [29], mating 
of rat [17]. 

Mode of action of the stimulus in evoking a response. 

1. Changes in threshold. Changes in internal factors (e.g., hormones) 
may affect the responsiveness to a particular external stimulus: such 
changes in responsiveness may occur even though the strength of the 
response once elicited is unaffected [21, 170]. Baerends [13] has shown 
that in Lebistes reticulatus the color pattern of the male can be used as 
an index of the internal state; and he has used this to demonstrate the 
effect of changes in the internal state on responsiveness to a constant 
stimulus [see also van Iersel, 95]. In exceptional cases practically no 
external stimulus may be necessary to elicit the response [Leerlaufreak- 
tionen, Vacuum Activities, Overflow Activities, 6, 128, 216]. On the 
other hand, extra-strong stimuli (supernormal stimuli) may be effective 
when the animal is unresponsive to a normally adequate one [135, 
216, 227]. 

2. Diverse nature of stimulus effects. In earlier ethological studies 
emphasis was laid on the function of the stimulus in releasing a response 
which was already “ready.” Later work has also underlined the follow- 
ing factors: 

a. The stimulating or motivating effect of the stimulus. The 
validity of the distinction between releasing and motivating effects is, 
however, still uncertain [77, 95]. 

b. Some stimuli (consummatory stimuli) serve to “switch off” 
the behavior in progress at the moment [16, 78, 207: see p. 573]. 

c. Some stimuli function mainly in the orientation of the re- 
sponse: homing of hunting wasps [22, 210, 224], hunting in Ammophila 
[9], orientation of ants [27, 232, 233], hunting in mantids [149], foraging 
of bees [61], orientation of fishes (demonstrating influence of internal 
factors) [85], various responses of fishes [12, 13, 95, 216], egg-rolling of 
goose [130], begging of young passerine birds [225], homing of birds 
[65, 84, 108, 109, 110, 111, 146, 183]. 

3. Principle of Heterogenous Summation (Reizsummenregel). If 
some of the sign stimuli which normally evoke a response are absent, 
the response may be reduced quantitatively but not qualitatively. 
The intensity of the response depends on how many of the sign stimuli 
are present but not on which (except in so far as the sign stimuli are of 
differing effectiveness). It is sometimes possible to construct models 
which combine sign stimuli in such a way that they are even more 
effective than the normal object [13, 55, 119, 135, 190, 191, 214, 216, 
220, 227]. 
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Analysis of the response. Many (but not all) forms of appetitive and 
consummatory behavior can be analyzed into a Fixed Action Pattern and 
an orientation component (see p. 573). The following references illustrate 
this: cichlid fishes [12, 190, 191], egg-rolling of goose [130], begging of 
young passerines [225]. 

Analysis of internal factors. So far only a few ethological studies in 
which internal factors were manipulated directly have been made by 
European ethologists, who have relied on the results of work by others 
(e.g., Beach and his pupils) in this field. The following, however, may 
be noted: gonadotrophins in sexual behavior of Xenopus laevis [179], 
parental behavior of doves [116], factors influeneing the change from 
one phase of the reproductive cycle to another in the Three-Spined 
Stickleback [95]. 

Investigation of the factors affecting one particular response. In a 
number of other studies attempts to analyze the factors affecting one 
particular response have been made. In some of these the effects of re- 
peated stimulation, in others the effects of varying “internal” factors, and 
in others the effects of varied external stimuli as well were studied. Par- 
ticular attention has been paid to the processes underlying the changes 
in responsiveness when a stimulus is presented continuously or repeti- 
tively: these processes can be characterized by (1) the degree of per- 
manence of the waning, and (2) the degree of specificity of the waning 
to the particular stimulus used. Among the responses investigated 
from this point of view have been the following: hunting of Ammophila 
[9], various patterns in Gryllus [92], hunting and sexual behavior of 
spiders [40, 169], fanning of the Three-spined Stickleback [95, 193], 
courtship of Guppy [13], wiping, equilibrating, hunting, etc., responses 
of Amphibians [23, 51, 58], breathing in newts [201], egg-laying of dove 
[32], parental behavior of dove [116], mobbing response of Chaffinches 
to predators [77], gobbling of Turkey [186], begging of young passerines 
[170]. [See also reviews in 129, 172.] yan 

Displacement activities. Sometimes when an animal is behaving in 
one way (e.g., aggressively) it suddenly shows behavior normally char- 
acteristic of a quite different context (e.g., feeding). Thus fighting Great 
Tits may suddenly start to peck at a bud or to pick up leaves and throw 
them over their shoulders—activities normally used in feeding. Such 
cases are grouped under the term “displacement activities” [104, 212, 
216, 218]. The concept of displacement activities thus has reference 
primarily to apparent irrelevance, and thus often to the known historical 
origin of the behavior pattern in question—a pattern developed (phylo- 
genetically or ontogenetically) in one context appears in another. Such 
activities sometimes occur when the animal has tendencies to behave in 
two incompatible ways, or is in the presence of difficult or insoluble 
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problems, etc. It is, therefore, sometimes said that they are activities 
“activated by a strange drive” or some similar phrase [6, 157, 218]. 
Tinbergen [218], however, makes it quite plain that this is only a theory 
about causation, not a matter of fact. “Displacement activities” are only 
one of a number of types of behavior which appear in “conflict” situa- 
tions [e.g., ambivalent behavior, 216; redirection activities, 16; threat 
the courtship postures, see references on p. 591; etc.]. Further, although 
the category of displacement activities has been a useful one at the state 
of research reached, it certainly contains very heterogeneous phenomena. 
It is now clear that the causal basis of many displacement activities is 
less different from that of the same motor patterns when appearing in 
their “normal” context than was formerly supposed [3, 156]. In other 
cases displacement activities may necessitate a radical revision of the 
ethologists’ concept of drive (p. 585). Indeed, the implied dichotomy be- 
tween “displacement” and “nondisplacement” behavior—or, as they are 
sometimes called, autochthonous and allochthonous behavior—can be a 
source of confusion: the distinction is not a clear one. 

Ontogeny of behavior—learning. 

1. The various patterns of behavior shown by an animal are not all 
equally easily modified by learning. Thus the homing behavior of the 
wasp Philanthus is easily modifiable, but the hunting behavior is not 
[216]. The extent to which the different parts of the behavior are 
modifiable is characteristic of the species and, presumably, an adaptive 
feature [216 and references cited, 220]. 

Further, when any particular response becomes modified by condi- 
tioning, the features of the environment which are selected are often 
characteristic of the species. Thus various species of Hymenoptera use 
visual landmarks in orientation, but the sort of landmarks which are 
selected varies from species to species [216]. 

2. The tendency for particular behavior patterns to be modified 
rather than others, and the tendency to learn some aspects of the situation 
rather than others, are particularly revealed in the phenomena of “im- 
printing.” The classic example is the Grey Lag gosling which, on hatch- 
ing from the egg, will follow the first moving object (within wide limits 
of shape, size, etc.) that it sees, and will thenceforward direct towards 
that object all the activities which are normally directed to the parent. 
Lorenz [121] claimed that imprinting is absolutely different from other 
types of learning—on the grounds that it (a) is irreversible, (b) is 
limited to a brief period of the life cycle, (c) sometimes occurs before 
the specific behavior patterns in which it will play a role have developed, 
(d) involves a learning of species rather than individual characteristics— 
but this absolute distinction cannot be maintained. It is however, par- 
ticularly interesting in that learning occurs without any conventional 
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reward, and involves a modification to “‘consummatory stimuli” [28, 29, 
54, 71, 82, 121, 122, 123, 129, 160, 161, 173, 175, 198]. 

The acquisition of song by some passerine birds seems to have some 
features in common with imprinting [167, 208, 209]. 

3. Other types of learning in animals have been reviewed by Thorpe 
[204, 206, 209]. The detailed comparisons which he makes between 
animals of different phyletic levels give a perspective to the study of 
learning which cannot be achieved by studies, however detailed, of a 
single species. 

4. Complex types of learning such as “counting” by birds and 
mammals [102, 103, 209]. 

5. Changes in responsiveness to a constant stimulus (see p. 593). 

6. The development of behavior and the relation between matura- 
tion and learning has been studied by a number of writers [e.g., 20, 28, 
41, 43, 44, 47, 89, 105, 106, 118, 161, 181, 182, 192, 208]. 

Evolution of behavior. 

1. Evolution of social releasers. The integration of social behavior 
depends on the elicitation of specific responses in one individual (the 
“reactor”) by sensory patterns provided by the structure or behavior 
of another (the “actor”), Communication of this sort has been facili- 
tated by evolutionary changes in both the structure and behavior of 
the actor, making them more improbable and striking, and in the 
responsiveness (i.e., in the “IRM”) of the reactor to these signals of 
the actor [122, 129, 145, 156, 214, 216, etc.]. 

So far, three sources of “displays” serving social communication have 
been recognized—“intention movements” (p. 577), “displacement activi- 
ties” (p. 593) (including autonomic responses), and redirection activities 
[158]: all have become modified in evolution for a signal function 
[39, 145, 156, 218, 221]. In addition, courtship displays have sometimes 
been evolved by the further modification of threat postures. The evolu- 
tionary changes involved include the development of conspicuous struc- 
tures, changes in the threshold, intensity, and form of the movements, 
and also changes in the relations between the components of the move- 
ments (“ritualization”). In some cases “emancipation” from the causal 
factors which primitively controlled the movement may also have 
occurred. (Similar changes may, of course, occur in movements not 
serving social communication.) The changes are such as to make the 
Movement more conspicuous, more species-specific, or (in the case of 
certain “submissive,” etc., displays) more different from displays of 
Opposite significance [10, 39, 79, 83, 121, 122, 123, 125, 127, 128, 143, 
152, 164, 214, 216, 220, 221]. ; 

Nonvisual communication has also been studied by ethologists [28, 
29, 53, 67, 141, 144, 145, 181, 182, 208, 216]. 
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2. The evolution of behavior as revealed by species differences. The 
comparative study of closely related forms enables deductions to be 
drawn about both the evolution of behavior and the phylogenetic rela- 
tionship between species. Although complex behavior systems (e.g. 
“social nesting”) have sometimes been used in such studies, they are 
often less certainly homologous between species, and their taxonomic 
utility is less easily assessed than that of simpler patterns. Fixed Action 
Patterns, especially those which are used in social communication be- 
tween species, have been most valuable [83, 121, 127, 129, 221]: 
Arthropods [1, 14, 26, 34, 35, 52, 53, 96, 97, 187, 199], fish [12, 57, 
155, 190, 191], birds [4, 8, 30, 64, 71, 72, 73, 74, 79, 120, 127, 128, 
158, 168, 196, 235], mammals [166]. 

Functional aspects of behavior. These are discussed in many 
ethological papers but see particularly [7, 37, 81, 139, 144, 176, 177, 
219]. 

Synthesis and links with neurophysiology. Complete understanding of 
behavior cannot be obtained merely by analysis into component variables. 
It is necessary (a) to resynthesize the products of the analysis in order to 
check its thoroughness, and to provide a picture of the behavior of the 
whole animal, and (b) to push the analysis still further to physiological 
levels. 

Tinbergen [213, 215, 216] and Baerends [9, 12] have indicated one 
way in which such a resynthesis can be performed. The various causal 
factors (independent variables) affecting the behavior of an animal can 
be arranged in a series of interlinked hierarchical schemes. Thus some 
factors (e.g., male sex hormone) influence many different activities, 
whereas others (e.g., the situation “female laying eggs in nest” for the 
male stickleback) influence only one. Such a scheme can be described 
as a “hierarchy of causal factors.” 

This concept of a hierarchy of causal factors, besides providing 
a means of synthesis and a way of looking at the whole behavior of the 
animal, is important in a number of other ways. For instance, as Tin- 
bergen has pointed out [213], much useless discussion and argument as 
to the validity of generalizations can be avoided if the level in such a 
hierarchical scheme which is being discussed is specified. 

It also provides an opportunity for a further link-up with physiological 
work. In a bold hypothetical scheme, Tinbergen has gone further than 
this to give a graphic representation of a hierarchical system of causally 
related nervous centers, each with its own IRM and each governing 4 
particular type of behavior. This model can be of great value in “helping 

"A rather different type of scheme is given by Kortlandt [106]. This makes use 


of the concept of “goal” but unfortunately fails to define it adequately. See Brit. 
J. Anim. Behav., 1957, 5, 116-118. 


Some Recent Trends in Ethology 597 


us to put our thoughts in order.” The general hierarchical arrangement is 
supported in some respects by both ethological and neurophysiological 
data [76, 216] though evidence is now accruing that the scheme as 
propounded requires rather fundamental modification [78, 80] (see p. 
582). 


SCOPE AND RANGE OF APPLICATION OF ETHOLOGY 


The work discussed in the preceding sections is not part of a self- 
contained system. Much of it consists of attempts to establish bridge- 
heads for liaison both with workers in peripheral fields and also with 
investigators of the same phenomena at a different level, such as physiolo- 
gists and biochemists. Its success must be judged by the extent to which 
these bridgeheads are secured—by the usefulness of ethological principles 
to both the psychologist running a rat in a maze and the physiologist 
pushing electrodes into its brain. It has already been emphasized that 
the aims and methods used by European ethologists are similar to those 
used by many American workers; and in some cases very similar con- 
clusions have been reached independently. Links with S-R theory, ex- 
pectancy theory [e.g., 209], information theory, and many other branches 
of psychology as well as with physiology and zoology are becoming in- 
creasingly apparent. 


ETHOLOGICAL METHODS BELIEVED TO BE VALUABLE IN 
CONTEXTS OUTSIDE THE SCOPE OF ETHOLOGY 


These have been mentioned in the preceding discussion and can be 
Summarized here: 


1. The necessity for observation, description, and classification before 
analysis 

2. The validity of the functional and evolutionary as well as the 
causal study of behavior, and the cross-fertility of these separate ap- 
proaches 

3. The importance of qualitative studies as a necessary preliminary 
to quantitative work 

4. The importance of designing experiments with a view to their 
biological relevance for the species concerned 

5. The role of a comparative approach 


a. In facilitating observation 
b. In elucidating the evolution of behavior 
c. In taxonomy 
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d. In providing a basis for generalization 
e. In leading to an understanding of function 


6. The importance of synthesis as well as analysis 


NOTE ON TERMINOLOGY 


Although no definitions of the terms used by ethologists are given in 
this paper, page references to discussions of a few of the more important 
concepts are given below. Some of them have been defined by Thorpe 
[205] and, more recently, Verplanck (in press) has provided a glossary 
of terms for behavior students on both sides of the Atlantic. 


Act, p. 576 

Allochthonous behavior, p. 594 

Appetitive behavior, p. 572 

Autochthonous behavior, p. 594 

Center, p. 582 

Component, p. 590 

Consummatory act, p. 572 

Consummatory stimuli, p. 573 

Displacement activity, p. 593 

Display, p. 590 

Drive, p. 585 

Emancipation, p. 595 

Fixed Action Pattern (Erbkoordina- 
tion), p. 573 

Heterogeneous Summation 
(Reizsummenregel), p. 592 

Imprinting, p. 594 

Inborn, p. 574-5 

Innate, p. 574-5 

Innate Releasing Mechanism 
(Schema, angeborenen auslö- 
sender Mechanismus), p. 584 


REFERENCES 


Intention movement, p. 577 

Mood (Stimmung) , p. 587 

Motivational impulses, p. 582 

Motor coordination mechanism, p. 
584 

Orientation component, p. 573 

Overflow activity, p. 592 

Reaction, p. 568, 576 

Reaction specific energy, p. 582 

Reafferenz, p. 568 

Redirection activity, p. 595 

Reflex, p. 576 

Releaser, p. 584 

Response, p. 576 

Ritualization, p. 595 

Sign stimulus, p. 584, 591 

Specific Action Potential, p. 586 

Supernormal stimulus, p. 592 

Tendency, p. 586 

Vacuum activity (Zeerlaufreaktion) , 
p. 592 


1. Adriaanse, M. S. C. A. Ammophila campestris Latr. and Ammophila 
adriaanse Wilke. Behaviour, 1947, 1, 1-35. 


2. Adrian, E. D., Bremer, F., & Jasper, H. H. Brain mechanisms and 
consciousness. Sym. Council int. Organisation med. Sci. Oxford: Blackwell, 


1954, 


3. Andrew, R. J. Some remarks on conflict situations, with special 
reference to Emberiza species. Brit. J. Anim. Behav., 1956, 4, 41-45. 
4. Andrew, R. J. A comparative study of certain behaviour patterns IM 


Some Recent Trends in Ethology 599 


the Yellowhammer, Corn Bunting and related species. Unpublished doctoral 
dissertation, Cambridge Univ., 1955. 

5, Armstrong, E. A. Diversionary display. 1. Connotation and ter- 
minology. 2. The nature and origin of distraction display. Ibis, 1949, 91, 
88-97, 179-188. 

6. Armstrong, E. A. The nature and function of displacement activities. 
Sym. Soc. exp. Biol., 1950, 4, 361-384. 

7. Armstrong, E. A. The ecology of distraction display. Brit. J. Anim. 
Behav., 1954, 2, 121-135. 

8. Armstrong, E. A. The wren. London: Collins, 1955. 

9. Baerends, G. P. Fortpflanzungsverhalten und Orientierung der Grab- 
wespe Ammophila campestris. Tidschr. Entomol., 1941, 84, 68-275. 

10. Baerends, G. P. Specializations in organs and movements with a re- 
leasing function. Sym. Soc. exp. Biol., 1950, 4, 337-360. 

11. Baerends, G. P. The ethological analysis of fish behaviour. In M. E. 
Brown (Ed.), The physiology of fishes. Vol. 2. New York: Academic Press, 
1957. 

12. Baerends, G. P., & Baerends, J. M. An introduction to the ethology 
of Cichlid fishes. Behaviour, 1950, Supplement No. 1, 1-242. 

13. Baerends, G. P., Bronwer, R., & Waterbolk, H. Tj. On the ethology 
of Lebistes reticulatus Peters. 1. Analysis of the male courtship pattern. Be- 
haviour, 1955, 8, 249-335. 

14. Bastock, M. A gene mutation which changes a behavior pattern. 
Evoln., 1957, 10, 421-439. 

15. Bastock, M., & Manning, A. The courtship of Drosophila melano- 
gaster. Behaviour, 1955, 8, 85-111. 

16. Bastock, M., Morris, D., & Moynihan, M. Some comments on con- 
flict and thwarting in animals. Behaviour, 1954, 6, 66-84. 

17. Beach, F. A. Analysis of the stimuli adequate to elicit mating be- 
haviour in the sexually inexperienced male rat. J. comp. Psychol., 1942, 33, 
163-207. 

18. Beach, F. A. Effects of testosterone propionate upon the copulatory 
oo of sexually inexperienced male rats. J. comp. Psychol., 1942, 33, 

247, 

19. Beach, F. A. The de-scent of instinct. Presidential address, Eastern 
Psychol. Ass., 1952. 

20. Beach, F. A. In B. Schaffner (Ed.), Transactions of the First Con- 
ference on Grou p Processes. New York: Macy Foundation, 1955. 

21. Beach, F. A., & Holz-Tucker, A. M. Effects of different con- 
centrations of androgen upon sexual behaviour in castrated male rats. J. 
comp. physiol. Psychol., 1949, 42, 433-453. ; i ae 

22. Beusekom, G. van. Some experiments on the optical orientation in 
Philanthus triangulum Fabr. Behaviour, 1948, 1, 195-225. 

23. Birukow, G. Ermiidung und Umstimmung bei Gleichgewichts- 
teaktionen der Amphibien. Verh. disch. Zool. Ges. Wilhelmshaven: 1951. 
Pp. 144-150, 


600 R. A. HINDE 


24. Birukow, G. Studien über statisch-optisch ausgelöste Kompensations- 
bewegungen und Körperhaltung bei Amphibien. Z. vergl. Physiol., 1952, 34 
448-472. 

25. Broadbent, D. Pavlovian conditioning and vigilance tasks. Medical 
Research Council, 1952, A.P.U. 175/52. 

26. Buchholtz, C. Eine vergleichende Ethologie der orientalischen 
Calopterygiden (Odonata) als Beitrag zu ihre systematischen Deutung. Z. 
Tierpsychol., 1955, 12, 364-386. 

27. Carthy, J. D. The orientation of two allied species of British ant. 
I and II. Behaviour, 1951, 3, 275-303; 304-318. 

28. Collias, N. E. The development of social behaviour in birds, Auk, 
1952, 69, 127-159. 

29. Collias, N., & Joos, M. The spectrographic analysis of sound signals 
of the domestic fowl. Behaviour, 1953, 5, 175-189, 

30. Craig, W. The expressions of emotion in the pigeon. I. The blond 
ting-dove (Turtur risorius). J. comp. Neurol. Psychol., 1909, 19, 29-80. 

31. Craig, W. The expressions of emotion in the pigeon. II. The 
mourning dove (Zenaidwa macrowahinn). Auk, 1911, 28, 398-407. 

32. Craig, W. Oviposition induced by the male in pigeons. J. Morphol. 
1911, 22, 299-305. 


33. Craig, W. Appetites and aversions as constituents of instincts. Biol. 
Bull., 1918, 34, 91-107. 

34. Crane, J. Comparative biology of Salticid spiders at Rancho Grande, 
Venezuela. Zoologica, 1948-50, 33-35. 

35. Crane, J. A comparative study of innate defensive behaviour in 
Trinidad Mantids. Zoologica, 1952, 37, 259-293. 
= Croweroft, P. Notes on the behaviour of shrews. Behaviour, 1955, 8, 

3-80. 


37. Cullen, E. Adaptations in the Kittiwake to cliff-nesting. Ibis, 1957, 
99, 275-302. 


ws Daanje, A. Uber das Verhalten des Haussperlings. Ardea, 1941, 30, 

39. Daanje, A. On the locomotory movements of birds and the in- 
tention movements derived from them. Behaviour, 1950, 3, 48-98. 

40. Drees, O. Untersuchungen iiber die angeborenen Verhaltensweisen 
bei Springspinnen. Z. Tierpsychol., 1952, 9, 169-207. 

41. Eibl-Eibesfeldt, I, Uber die Jugendentwicklung des Verhaltens eines 
ae Dachses (Meles meles L.). Z. Tierpsychol., 1950, 7, 327- 


42. Eibl-Eibesfeldt, I. Ein Beitrag zur Paarungsbiologie der Erdkröte. 
Behaviour, 1950, 2, 217-237, 

43. Eibl-Eibesfeldt, 1. Beobachtungen zur Fortpflanzungsbiologie und 
Jugendentwicklung des Eichhörnchens (Sciurus vulgaris L.). Z. Tierpsychol., 
1951, 8, 370-400. 

44. Eibl-Eibesfeldt, I. Gefangenschaftsbeobachtungen an der persischen 


Wüstenmaus (Meriones persicus persicus Blanford). Z. Tierpsychol., 1951, 
8, 400-423. 


5) 


Some Recent Trends in Ethology 601 


45. Eibl-Eibesfeldt, I. Nahrungserwerb und Beuteschema der Erdkröte 
(Bufo bufo L.). Behaviour, 1952, 4, 1-36. 

46. Eibl-Eibesfeldt, I. Vergleichende Verhaltensstudien an Anuren. I. 
Zur Paarungsbiologie des Haubfrosches Hyla arborea L. Z. Tierpsychol., 
1952, 9, 383-395. 

_ 47. Eibl-Eibesfeldt, I. Zur Ethologie des Hamsters (Cricetus cricetus). 
Z. Tierpsychol., 1953, 10, 204-254. 

48. Eibl-Eibesfeldt, I. Der Kommentkampf der Meerechse. Z. Tier- 
psychol., 1955, 12, 49-62. 

49. Eibl-Eibesfeldt, I. Ethologische Studien am Galápagos Seelöwen. Z. 
Tierpsychol., 1955, 12, 286-303. 

50. Eibl-Eibesfeldt, I. Zur Biologie des Iltis (Putorius putorius). Verh. 
dtsch. Zool. Ges. Erlangen: 1955. Pp. 304-323. 

51. Eikmanns, K-H. Verhaltensphysiologische Untersuchungen über den 
Beutefang und das Bewegungssehen der Erdkröte. Z. Tierpsychol., 1955, 
12, 229-253. 

52. Evans, H. E. An ethological study of the Digger Wasp Bembecinus 
neglectus, with a review of the ethology of the genus. Behaviour, 1955, 7, 
287-304. 

53. Faber, A. Laut- und Gebärdensprache bei Insekten. Vol. 1. Orthop- 
tera. Stuttgart: 1953. 

54. Fabricius, E. Zur Ethologie junger Anatiden. Acta. Zool. Fenn., 
1951, 68, 1-178. 

55. Fabricius, E. Aquarium observations on the spawning behaviour of 
the Char, Salmo alpinus. Inst. Freshwater Research, Report 34. Drottning- 
holm, 1953, 

56. Fabricius, E. Aquarium observations on the spawning behaviour of 
the Burbot, Lota vulgaris L. Inst. Freshwater Research, Report No. 35. 
Drottningholm, 1954. 

57. Fiedler, K. Vergleichende Verhaltensstudien an Seenadeln, Schlang- 
en und Seepferdchen (Syngnathidae). Z. Tierpsychol., 1954, 11, 358- 

6. 

58. Franzisket, L. Untersuchungen zur Spezifität und Kumulierung der 
Erregungsfähigkeit und zur Wirkung einer Ermüdung in der Afferenz bei 
ee des Rückenmarkfrosches. Z. vergl. Physiol., 1953, 34, 

5-538. 

59. Free, J. B. A study of the stimuli which release the food-begging 
ag offering responses of worker honey-bees. Brit. J. Anim. Behav., 1956, 4, 

101. 

60. Frisch, K. von. Uber einer Schreckstoff der Fischhaut und seine 
biologische Bedeutung. Z. vergl. Physiol., 1941, 29, 46-145. 

61. Frisch, K. von. An account of the life and senses of the honey bee. 
London: Methuen, 1954. 

62. Goethe, F. Beobachtungen bei der Aufzucht junger Silbermöwen. 
2. Tierpsychol., 1955, 12, 402-434. 

63. Goodwin, D. Some aspects of the behaviour of the Jay (Garrulus 
glandarius) . Ibis, 1951, 93, 414442; 602-625. 


602 R. A. HINDE 


64. Goodwin, D. A comparative study of the voice and some aspects of 
behaviour in two old-world Jays. Behaviour, 1952, 4, 293-316. 

65. Griffin, D. R. Bird navigation. Biol. Rev., 1952, 359-400. 

66. Haartman, L. von. Der Trauerfliegenschnäpper. I. Ortstreue und 
Rassenbildung. II. Populationsprobleme. III. Die Nahrungsbiologie. Acta 
Zool. Fenn., 1949, 56, 1-104; 1951, 67, 1-60; 1954, 83, 1-92. . 

67. Haartman, L. von., & Löhrl, H. Die Lautäusserungen des Trauer- 
und Halsbandfliegenschnäppers. Orn. Fenn., 1950, 4, 86-97. 

68. Hartley, P. H. T. An experimental analysis of interspecific recogni- 
tion. Sym. Soc. exp. Biol., 1950, 4, 313-336. 

69. Hebb, D. ©. Drives and the C.N.S. (Conceptual Nervous System). 
Psych. Rev., 1955, 62, 243-254. 

70. Hebb, D. ©. Heredity and environment in mammalian behaviour. 
Brit. J. Anim. Behav., 1953, 1, 43-47. 

71. Heinroth, ©. Beiträge zur Biologie, namentlich Ethologie und Psy- 
chologie der Anatiden. Verh. V. Int. Ornithol. Kongr. Berlin: 1911. Pp. 
589-702. 

72. Heinroth, O., & Heinroth K. Verhaltensweisen der Felsentaube 
(Haustaube) Columba livia livia L. Z. Tierpsychol., 1949, 6, 153-201. 

73. Heinroth, O., & Heinroth, M. Die Vögel Mitteleuropas. Berlin: 
Bermuhler, 1928, 

74, Hinde, R. A. The behaviour of the Great Tit (Parus major) and 
some other related species. Behaviour, 1952. Supplement No. 2. Pp. 1-201. 

75. Hinde, R. A. The conflict between drives in the courtship and 
copulation of the Chaffinch (Fringilla coelebs). Behaviour, 1953, 5, 1-31. 

76. Hinde, R. A. Appetitive behaviour, consummatory act, and the 
hierarchical organisation of behaviour—with special reference to the Great 
Tit. Behaviour, 1953, 5, 191-224. 

77. Hinde, R.° A. Factors governing the changes in strength of a par- 
tially inborn response, as shown by the mobbing behaviour of the Chaffinch 
(Fringilla coelebs). I. The nature of the response, and an examination of its 
course. II. The waning of the response. Proc. roy. Soc. B., 1954, 142B, 306- 
331, 331-358, and in preparation. 

78. Hinde, R. A. Changes in responsiveness to a constant stimulus. Brit. 
J. Anim. Behav., 1954, 2, 41-55. 

79. Hinde, R. A. A comparative study of the behaviour of certain 
finches. Ibis, 1955, 97, 706-745; 98, 1-23. 

80. Hinde, R. A. Ethological models and the concept of drive. Brit. I. 
Philos. Sci., 1956, 6, 321-331. 

81. Hinde, R. A. The biological significance of the territories of birds. 
Ibis, 1956, 98, 340-369, and other Papers in this number. 

82. Hinde, R. A., Thorpe, W. H., & Vince, M. A. The following re- 
sponse of young moorhens and coots. Behaviour, 1956, 9, 214-242. 

83. Hinde, R. A., & Tinbergen, N. The comparative study of species- 
characteristic behaviour. In Evolution and Behavior. New Haven, Conn.: 
Yale Univer. Press, in press. 


Some Recent Trends in Ethology 603 


84. Hoffmann, K. Versuche zu der im Richtungsfinden der Vögel ent- 
haltenen Zeitschätzung. Z. Tierpsychol., 1954, 11, 453-475. 

85. Holst, E. von. Quantitative Messung von Stimmungen im Verhalten 
der Fische. Sym. Soc. exp. Biol., 1950, 4, 143-172. 

86. Holst, E. von. Relations between central nervous system and periph- 
eral sense organs. Brit. J. Anim. Behav., 1954, 2, 89-9. 

87. Holst, E. von, & Mittelstaedt, H. Das Reafferenzprincip. Natur- 
wissenschaften, 1950, 37, 464. 

88. Holzapfel, M. Uber Bewegungsstereotypen bei gehaltenen Säugern. 
I-IV. Z. Tierpsychol, 1938, 2, 46-60, 60-72; Zoologische Garten, 1939, 10, 
184-193; Z. Tierpsychol., 1939, 3, 151-160. 

89. Holzapfel, M. Die Beziehung zwischen den Trieben junger und 
erwachsener Tiere. Schweizerische Z. Psychol., 1949, 8, 32-60. 

90. Howard, H. E. The nature of a bird’s world. Cambridge: Cambridge 
Univer. Press, 1935. 

91. Howard, H. E. A waterhen’s worlds. Cambridge: Cambridge Univer. 
Press, 1940. 

92. Hubert, F. Sitz und Bedeutung nervöser Zentren für Instinkthand- 
lungen beim Männchen von Gryllus campestris L. Z. Tierpsychol., 1955, 
12, 12-48. 

93. Huxley, J. S. The courtship habits of the great crested grebe 
(Podiceps cristatus). Proc. Zool. Soc. Lond., 1914, 2, 491-562. 

94. Huxley, J. S. Evolution: the modern synthesis. London: Allen and 
Unwin, 1942. 

95. Iersel, J. A. A. van. An analysis of the parental behaviour of the 
male Three-spined Stickleback. Behaviour, 1953. Supplement No. 3. 

96. Jacobs, W. Vergleichende Verhaltensttudien an Feldheuschrecken. 
Z. Tierpsychol., 1950, 7, 169-216. 

97. Jacobs, W. Verhaltensbiologische Studien an Feldheuschrecken. Z. 
Tierpsychol. Supplement, 1953. ; 

98. Keenleyside, M. H. A. Some aspects of the schooling behaviour of 
fish. Behaviour, 1955, 8, 183-249. 

99. Kennedy, J. S. Is modern ethology objective? Brit. J. Anim. Behav., 
1954, 2, 12-29. 

100. Koenig, L. Beiträge zur einem Aktionssystem des Bienenfressers 
(Merops apiaster L.). Z. Tierpsychol., 1951, 8, 169-210. t 

101. Koenig, L. Beobachtungen am afrikanischen Blauwengenspint 
(Merops superciliosus chrysocercus) im freier Wildbahn und Gefangen- 
schaft. Z. Tierpsychol., 1953, 10, 180-204. : 

102. Köhler, ©. “Zahl”-versuche an einem Kohlraben und Vergleichs- 
versuche an Menschen. Z. Tierpsychol., 1943, 5, 575-712. 

103. Köhler, ©. The ability of birds to count. Bull. Anim. Behav., 1951, 
9, 41-45. 

104. Kortlandt, A. Wechselwirkung zwischen Instinkten. Arch. Neerl. 
Zoöl., 1940, 4, 443-520. 

105. Kortlandt, A. Eine Übersicht der angeborenen Verhaltensweisen des 


604 R. A. HINDE 


Mitteleuropäischen Kormorans (Phalacrocorax carbo sinensis). Arch. Neérl. 
Zoöl., 1940, 4, 401-442. 

106. Kortlandt, A. Aspects and prospects of the concept of instinct 
(vicissitudes of the hierarchy theory). Arch. Neérl. Zoöl., 1955, 11, 155- 
284. 

107. Kramer, G. Beobachtungen über Paarungsbiologie und soziales 
Verhalten von Mauereidechsen. Z. Morph. Ökol. Tiere, 1937, 32, 752-783. 

108. Kramer, G. Experiments on bird orientation. Ibis, 1952, 94, 265- 
285. 

109. Kramer, G. Wird die Sonnenhöhe bei Heimfindorientierung ver- 
wertet? J. Ornithol., 1953, 94, 201-219. 

110. Kramer, G. Die Sonnenorientierung der Vögel. Verh. dtsch. Zool. 
Ges. Freiburg: 1952. Pp. 72-84. 

111. Kramer, G. Ein weiterer Versuch, die Orientierung von Brieftauben 
durch jahreszeitliche Änderung der Sonnenhöhe zu beeinflussen. J. Ornithol, 
1955, 96, 173-185. 

112. Lack, D. The behaviour of the robin. I & II. Proc. Zool. Soc. 
Lond., 1939, A, 109, 169-178. 

113. Lack, D. The life of the robin. London: H. F. & G. Witherby, 
1943, 

114. Laven, H. Beiträge zur Biologie des Sandregenpfeifers (Charadrius 
hiaticula). J. Ornithol., 1940, 88, 183-288, s 

115. Lehrman, D. S. A critique of Konrad Lorenz’s theory of instinctive 
behaviour. Quart. Rev. Biol., 1953, 28, 337-363. 

116. Lehrman, D. S. The physiological basis of parental feeding be- 
Reo in the Ring Dove (Streptopelia risoria). Behaviour, 1955, 7, 241- 

117. Lehrman, D. S. In B. Schaffner (Ed.), Transactions of the First 
Conference on Group Processes. New York: Macy Foundation, 1955. 

118. Lindemann, W. Uber die Jugendentwicklung beim Luchs (Lynx 
l. lynx Kerr) und bei der Wildkatze (Felis s. silvestris Schreb). Behaviour, 
1955, 8, 1-45. 

119. Lissmann H. W. Die Umwelt des Kampffisches (Betta splendens 
Regen). Z. vergl. Physiol., 1932, 18, 65-111. 

Giese Beiträge zur Ethologie sozialer Corviden. J. Ornithol, 


121. Lorenz, K. Der Kumpan in der U: It des Vögels. J. Ornithol., 
1935, 83, 137-213; 289-413. eg ee 
peer Lorenz, K. The companion on the bird’s world. Auk, 1937, 54 


oe) ree K. Uber den Begriff der Instinkthandlung. Folia Biotheot» 
124. Lorenz, K. Über die Bildung des Instinktbegriffs. Naturwissen- 
schaften, 1937, 25, 289-300; 307-318; 324-331. 


ee wo Verhaltensforschung. Zool. Anz. Suppl. 


Some Recent Trends in Ethology 605 


126. Lorenz, K. Die Paarbildung beim Kohlrabe. Z. Tierpsychol., 1939, 
3, 278-292. 

127. Lorenz, K. Vergleichende Bewegungsstudien an Anatiden. J. Orni- 
thol., 1941, 89. Sonderheft 194-294. 

128. Lorenz, K. The comparative method in studying innate behaviour 
patterns. Sym. Soc. exp. Biol., 1950, 4, 221-268. 

129. Lorenz, K. In B. Schaffner (Ed.), Transactions of the First Con- 
ference on Group Processes. New York: Macy Foundation, 1955. 

130. Lorenz, K., & Tinbergen, N. Taxis und Instinkthandlung in der 
Eirollbewesung der Graugans. I. Z. Tierpsychol., 1938, 2, 1-29. 

131. MacCorquodale, K., & Meehl, P. E. On a distinction between 
hypothetical constructs and intervening variables. Psych. Rev., 1948, 55, 
95-107. 

132. MacCorquodale, K., & Meehl, P. E. Edward C. Tolman. In Estes, 
W. K., et al., Modern learning theory. New York: Appleton-Century-Crofts, 
1954. 

133. Magnus, D. Beobachtungen zur Balz und Eiablage des Kaiser- 
mantels Argynnis paphia L. Z. Tierpsychol., 1950, 7, 435449. 

134. Magnus, D. Experimentelle Untersuchungen am Kaisermantel zu 
Analyse optischer Auslösungsreize. Deutscher Entomologentag in Hamburg, 
1953, 58-75. 

135. Magnus, D. Zur Problem der “überoptimalen” Schlüsselreize. Verh. 
dtsch. Zool. Ges. Tubingen: 1954. 

136. Maier, N. R. F., & Schnierla, T. C, Principles of animal psychology. 
New York and London: MeGraw-Hill, 1935. 

137. Makkink, G. F. An attempt at an ethogram of the European avocet 
(Recurvirostra avosetta L.) with ethological and psychological remarks. 
Ardea, 1936, 25, 1-60. 

138. Makkink, G. F. Contribution to the knowledge of the behaviour of 
the Oystercatcher (Haematopus ostralegus). Ardea, 1942, 31, 23-74. 

i 139. Manning, A. The effect of honeyguides. Behaviour, 1956, 9, 114- 
39. 

140. Manning, A. Some aspects of the foraging behaviour of bumble- 
bees. Behaviour, 1956, 9, 164-201. 

141. Marler, P. Characteristics of some animal calls. Nature, 1955, 

142. Marler, P. Studies of fighting in Chaffinches. I, II, III. Brit. 
J. Anim. Behav., 1955, 3, 111-117; 137-146; 1956, 4, 23-30; 1957, 5, 29-37. 

143. Marler, P. The behaviour of the Chaffinch. Behaviour, 1956. Sup- 
plement No. 5. 

144. Marler, P. The voice of the Chaffinch and its function as a lan- 
guage. Ibis, 1956, 98, 231-261. 

_ 145. Marler, P. Specific distinctiveness in the communication signals of 
birds. Behaviour, 11, 13-39. 

146. Matthews, G. V. T. Bird navigation. Cambridge: Cambridge 

Univer, Press, 1955. 


606 R. A. HINDE 


147. Mayr, E. Systematics and the origin of species. New York: Colum- 
bia Univer. Press, 1942. 

148. Meehl, P. E., & MacCorquodale, K. Some methodological com- 
ments concerning expectancy theory. Psych. Rev., 1951, 58, 230-233. 

149. Mittelstaedt, H. Uber die Beutefangmechanismus der Mantiden. 
Verh. dtsch. Zool. Ges. Freiburg: 1950. Pp. 102-106. 

150. Mittelstaedt, H., & Holst, E. von. Reafferenzprincip und Opto- 
motorik. Zool. Anzeiger, 1953, 151, 253-257. 

151. Morris, D. Homosexuality in the Ten-spined Stickleback (Pygosteus 
pungitius L.). Behaviour, 1952, 4, 233-262. 

152. Morris, D. The reproductive behaviour of the Zebra Finch (Poe- 
phila guttata) with special reference to pseudofemale behaviour and dis- 
placement activities. Behaviour, 1954, 6, 271-322. 

153. Morris, D. The reproductive behaviour of the River Bullhead 
(Cottus gobio L.) with special reference to the fanning activity. Behaviour, 
1954, 7, 1-32. 

154. Morris, D. The causation of pseudofemale and pseudomale be- 
haviour: a further comment. Behaviour, 1955, 8, 46-56. 

155. Morris, D. The reproductive behaviour of the Ten-spined Stickle- 
back (Pygosteus pungitius L.). Behaviour, 1958. Supplement No. 6. 

156. Morris, D. The feather postures of birds and the problem of the 
origin of social signals. Behaviour, 1956, 9, 75-113. 

157. Moynihan, M. Some displacement activities of the Black-headed 
Gull. Behaviour, 1953, 5, 58-80. 

158. Moynihan, M. Some aspects of reproductive behaviour in the 
Black-headed Gull (Larus ridibundus L.) and related species, Behaviour, 
1955. Supplement No. 4, 1-201. 

159. Moynihan, M., & Hall, M. F. Hostile, sexual and other social be- 
haviour patterns of the Spice Finch (Lonchura punctulata) in captivity. 
Behaviour, 1954, 7, 23-76. 

160. Nice, M. M. Some experiences in imprinting ducklings. Condor, 
1953, 55, 33-37. 


161. Nicolai, J. Zur Biologie und Ethologie des Gimpels. Z. Tierpsychol., 
1956, 13, 93-133. 

162. Nissen, H. W. Description of the learned response in discrimination 
behaviour. Psych. Rev., 1950, 57, 132. (See also Psych. Rev., 1952, 59, 161- 
167.) 

163. Nolte, A. Freilandbeobachtungen über das Verhalten von Macaca 
radiata in Südindien. Z. Tierpsychol., 1955, 12, 77-87. 

f 164. Pelkwijk, J. J. ter, & Tinbergen, N. Eine reizbiologische Analyse 
einiger Verhaltensweisen von Gasterosteus aculeatus L. Z. Tierpsychol., 
1937, 1, 193-204. 

_ 165. Picken, L. E. R. The study of minute biological structures and their 
significance in the organisation of cells. School Sci. Rev., 1955, 129, 262- 
268; 130, 332-338; 131, 30-37. 


166. Pilters, H. Untersuchungen über angeborene Verhaltensweisen bei 


Some Recent Trends in Ethology 607 


Tylopoden unter besonderer Berucksichtigung der neuweltlichen Formen. 
Z. Tierpsychol., 1954, 11, 213-303. 

167. Poulsen, H. Inheritance and learning in the song of the Chaffinch. 
Behaviour, 1951, 3, 216-228. 

168. Poulsen, H. A study of incubation responses and some other be- 
haviour patterns in birds. Vidensk. Medd. fra Dansk naturh. Foren., 1953, 
115, 1-131. 

169. Precht, H. Uber das angeborene Verhalten vom Tieren: Versuche 
an Springspinnen. Z. Tierpsychol., 1952, 9, 207-230. 

170. Prechtl, H. F. R. Zur Physiologie der angeborenen auslésenden 
Mechanismen. Behaviour, 1953, 5, 32-50. 

171. Prechtl, H. F. R. Zur Paarungsbiologie einiger Molcharten. Z 
Tierpsychol., 1951, 8, 337-348. 

172. Prechtl, H. F. R. Neurophysiologische Mechanismen des form- 
starren Verhaltens. Behaviour, 1956, 9, 243-319. 

173, Raber, H. Analyse des Balzverhaltens eines domestizierten Trut- 
hahns. Behaviour, 1948, 1, 237-266. 

174. Raber, H. Das verhalten gefangener Waldohreulen (Asio otus 
otus) und Waldkäuze (Strix aluco aluco). Behaviour, 1949, 2, 1-95. 
a 175. Ramsay, A. ©. Familial recognition in domestic birds. Auk, 1951, 

, 1-16. 

176. Ruiter, L. de. Some experiments on the camouflage of stick 
caterpillars. Behaviour, 1952, 4, 222-233. 

177. Ruiter, L. de. Countershading in caterpillars. Arch. Néerl. Zool., 
1955, 11, 285-341. 

178. Russell, E. S. Perceptual and sensory signs in instinctive behaviour. 
Proc. Linn. Soc. Lond., 1943, 154, 195-216. 

179. Russell, W. M. S. Experimental studies on the reproductive be- 
haviour of Xenopus laevis L. I. The control mechanisms for clasping and 
Fe aping and the specificity of hormone action. Behaviour, 1954, 7, 113- 

8. 


180. Russell, W. M. S., Mead, A. P., & Hayes, J. S. A basis for the 
Quantitative study of the structure of behaviour. Behaviour, 1954, 6, 153- 


181. Sauer, F. Die Entwicklung der Lautäusserungen vom Ei ab- 
schalldichtgehaltener Dorngrasmücken (Sylvia c. communis Latham) im 
vergleich mit später isolierten und mit wildleben Artgenossen. Z. Tierpsy- 
chol., 1954, 11, 10-93. 

182. Sauer, F. Uber das Verhalten junger Gartengrasmücken Sylvia 
borin. J. Ornithol., 1956, 97, 156-187. r 

183. Sauer, F., & Sauer, E. Zur Frage der nächtlichen Zugorientierung 
von Grasmücken. Rev. Suisse Zool., 1955, 62, 250-259. 

A 184. Schenkel, R. Ausdruck—Studien an Wölfen. Behaviour, 1947, 1, 
1-130, 

185. Schiller, P. H. Innate constituents of complex responses in primates. 

Psych, Rev., 1952, 59, 177-191. 


608 R. A. HINDE 


186. Schleidt, M. Untersuchungen über die Auslösung des Kollerns beim 
Truthahn (Meleagris gallapagos). Z. Tierpsychol., 1952, 11, 417-435. 

187. Schmidt, R. S. Termite nests—important ethological material. Be- 
haviour, 1955, 8, 344-357. 

188. Schneirla, T. C. A consideration of some conceptual trends in com- 
parative psychology. Psych. Bull., 1952, 49, 559-597. 

189. Scott, J. P., & Fredericson, E. The cause of fighting in mice and 
rats. Physiol. Zool., 1951, 24, 273-309. 

190. Seitz, A. Die Paarbildung bei einigen Cichliden. Z. Tierpsychol., 
1940, 4, 40-84; 1941, 5, 74-101. 

191. Seitz, A. Vergleichende Verhaltensstudien an Buntbarschen. Z. 
Tierpsychol., 1949, 6, 202-235. 

192. Seitz, A. Untersuchungen über angeborene Verhaltensweisen bei 
Caniden, Z. Tierpsychol., 1955, 12, 463-489, 

193. Sevenster, P. A quantitative study of “displacement” fanning in 
the Three-spined Stickleback as compared with parental fanning, in 
preparation. 

194. Sherrington, C. S. The integrative action of the nervous system. 
New York: Charles Scribners’ Sons, 1906. 

195. Simmons, K. E. L. The nature of the predator-reactions of breed- 
ing birds. Behaviour, 1952, 4, 161-172. 

196. Simmons, K. E. L. Some aspects of the aggressive behaviour of 
three closely related plovers (Charadrius). Ibis, 1953, 95, 115-127. 

197. Simmons, K. E. L. Studies on Great Crested Grebes. Avic. Mag., 
1951, 61. 

198. Spalding, D. A. Instinct, with original observations on young 
animals. MacMillan’s Magazine, 1873, 27, 282-293 (reprinted Brit. J. Anim. 
Behav., 1952, 2, 2-11). 

199. Spieth, H. T. Mating behaviour and sexual isolation in the 
Drosophila virilis species group. Behaviour, 1950, 3, 105-145. 

200. Spurway, H. Territory and evolution in Sticklebacks. Penguin new 
biology, 1953, 4, 33-43. 

201. Spurway, H., & Haldane, J. B. S. The comparative ethology of 
vertebrate breathing. I. Breathing in newts, with a general survey. Be- 
haviour, 1954, 6, 8-34. 

202, Stonehouse, B. The Emperor Penguin (Aptenodytes forsteri). 
Falkland Islands Dependencies Survey, Scientific Reports, No. 6, 1953. 

203. Stride, G. O. On the courtship behaviour of Hypolimnas misip pus 
L. (Lepidoptera, Nymphalidae). Brit. J. Anim Behav., 1956, 4, 52-68. 

204. Thorpe, W. H. Types of learning in insects and other arthropods. 
Brit. J. Psychol., 1943-44, 33, 220-234; 34, 20-31; 66-76. 

205. Thorpe, W. H. The definition of some terms used in animal be- 
haviour studies. Bull. Anim. Behav., 1951, 9, 34-40. 
eee Thorpe, W. H. The learning abilities of birds. Ibis, 1951, 93, 1-52; 

296. 

207. Thorpe, W. H. Some concepts of modern ethology Nature, 1954, 
174, 101-105. 


Some Recent Trends in Ethology 609 


208. Thorpe, W. H. The process of song-learning in the Chaffinch as 
studied by means of the sound-spectrograph. Nature, 1954, 173, 465-469. 

209. Thorpe, W. H. Learning and instinct in animals. London: 
Methuen, 1956. 

210. Tinbergen, N. Uber die Orientierung des Bienenwolfes (Philanthus 
triangulum Fabr.). Z. vergl. Physiol., 1932, 18, 305-335; 1935, 21, 699-716. 

211. Tinbergen, N. The behaviour of the Snow Bunting in spring. Trans. 
Linn. Soc. N.Y., 1939, 5, 1-94. 

212. Tinbergen, N. Die Ubersprungbewegung. Z. Tierpsychol., 1940, 4, 
1-10. 

213. Tinbergen, N. An objectivistic study of the innate behaviour of 
animals. Biblioth. Biotheor., 1942, 1, 39-98. 

214. Tinbergen, N. Social releasers and the experimental method re- 
quired for their study. Wilson Bull., 1948, 60, 6-52. 

215. Tinbergen, N. The hierarchial organization of the nervous mecha- 
nisms underlying instinctive behaviour. Sym. Soc. exp. Biol., 1950. Pp. 305- 
312. 

216. Tinbergen, N. The study of instinct. London: Oxford Univer. Press, 
1951. 

217. Tinbergen, N. Recent advances in the study of bird behaviour. 
Proc. Xth int. Ornithol. Congr. Uppsala, 1950, 360-374. ’ 

218. Tinbergen, N. Derived activites: their causation, biological sig- 
ee origin and emancipation during evolution. Quart. Rev. Biol., 1952, 

‚1-32. A 

219. Tinbergen, N. Social behaviour in animals. London: Methuen, 
1953, 

220. Tinbergen, N. The herring gull’s world. London: Collins, 1953. 

221. Tinbergen, N. The origin and evolution of courtship and threat 
display. In A. C. Hardy, J. S. Huxley, & E. B. Ford, (Eds.), Evolution as 
@ process. London: Allen & Unwin, 1954. ; 

222. Tinbergen, N. In B. Schaffner (Ed.), Transactions of the First 
Conference on Group Processes. New York: Macy Foundation, 1955. A 

223. Tinbergen, N., & van Iersel, J. A. A. “Displacement reactions” in 
the Three-spined Stickleback. Behaviour, 1947, 1, 56-63. : 

224. Tinbergen, N., & Kruyt, W. Uber die Orientierung des Bienen- 
wolfes, III. Zeits. vergl. Physiol., 1938, 25, 292-334. i 

225. Tinbergen, N., & Kuenen, D. J. Über die auslösenden und die 
tichtunggebenden Reizsituation der Sperrbewegung von jungen Drosseln. 
2. Tierpsychol., 1939, 3, 37-60. i 

226. Tinbergen, N., Meeuse, B. B. D., Boerema, L. K., & Varossieau, 
W. W. Die Balz des Samtfalters Eumenis (= Satyrus) semele (L.) Z. Tier- 
psychol., 1942, 5, 182-226. PEY X 

227. Tinbergen, N., & Perdeck, A. C. On the stimulus situation releasing 
the begging response in the newly-hatched Herring Gull chick (Larus a. 
argentatus Pont.) . Behaviour, 1950, 3, 1-38. s 
= 228. Toulmin, S. E. The philosophy of science. London: Hutchinson, 

53, 


608 R. A. HINDE 


186. Schleidt, M. Untersuchungen über die Auslösung des Kollerns beim 
Truthahn (Meleagris gallapagos). Z. Tierpsychol., 1952, 11, 417-435, 

187. Schmidt, R. S. Termite nests—important ethological material. Be- 
haviour, 1955, 8, 344-357. 

188. Schneirla, T. C. A consideration of some conceptual trends in com- 
parative psychology. Psych. Bull., 1952, 49, 559-597. 

189. Scott, J. P., & Fredericson, E. The cause of fighting in mice and 
rats. Physiol. Zool., 1951, 24, 273-309. 

190. Seitz, A. Die Paarbildung bei einigen Cichliden. Z. Tierpsychol, 
1940, 4, 40-84; 1941, 5, 74-101. 

191. Seitz, A. Vergleichende Verhaltensstudien an Buntbarschen. Z. 
Tierpsychol., 1949, 6, 202-235. i 

192. Seitz, A. Untersuchungen über angeborene Verhaltensweisen bei 
Caniden. Z. Tierpsychol., 1955, 12, 463—489. >. 

193. Sevenster, P. A quantitative study of “displacement” fanning In 
the Three-spined Stickleback as compared with parental fanning, in 
preparation. 

194. Sherrington, C. S. The integrative action of the nervous system. 
New York: Charles Scribners’ Sons, 1906. 

195. Simmons, K. E. L. The nature of the predator-reactions of breed- 
ing birds. Behaviour, 1952, 4, 161-172. y 

196. Simmons, K. E. L. Some aspects of the aggressive behaviour of 
three closely related plovers (Charadrius). Ibis, 1953, 95, 115-127. 

A Or Somon K. E. L. Studies on Great Crested Grebes. Avic. Mags 
51, 61. 

198. Spalding, D. A. Instinct, with original observations on young 
animals. MacMillan’s Magazine, 1873, 27, 282-293 (reprinted Brit. J. Anim. 
Behav., 1952, 2, 2-11). ; 

199. Spieth, H. T. Mating behaviour and sexual isolation in the 
Drosophila virilis species group. Behaviour, 1950, 3, 105-145. x 

200. Spurway, H. Territory and evolution in Sticklebacks. Penguin neu 
biology, 1953, 4, 33-43. f 

201. Spurway, H., & Haldane, J. B. S. The comparative ethology © 
vertebrate breathing, I. Breathing in newts, with a general survey. 2 
haviour, 1954, 6, 8-34. k N 

202. Stonehouse, B. The Emperor Penguin (Aptenodytes forsteri): 
Falkland Islands Dependencies Survey, Scientific Reports, No. 6; 122 5 

Spa: Stride, G. O. On the courtship behaviour of Hypolimnas mith 
L. (Lepidoptera, Nymphalidae). Brit. J. Anim Behav., 1956, 4 ds 
B „204. Thorpe, W. H. Types of learning in insects and other arthropO 

rit. J. Psychol., 1943-44, 33, 220-234; 34, 20-31; 66-76. «nal be 

205. Thorpe, W. H. The definition of some terms used in anima 
haviour studies. Bull. Anim. Behav., 1951, 9, 34-40. 


206. Thorpe, W. H. The learning abilities of birds. Ibis, 1951, 93, 1 
252-296. 054 
19% 


207. Thorpe, 
174, 101-105, 


5% 


W. H. Some concepts of modern ethology Nature, 


Some Recent Trends in Ethology 609 


208. Thorpe, W. H. The process of song-learning in the Chaffinch as 
studied by means of the sound-spectrograph. Nature, 1954, 173, 465-469. 

209. Thorpe, W. H. Learning and instinct in animals. London: 
Methuen, 1956. 

210. Tinbergen, N. Uber die Orientierung des Bienenwolfes (Philanthus 
triangulum Fabr.). Z. vergl. Physiol., 1932, 18, 305-335; 1935, 21, 699-716. 

211. Tinbergen, N. The behaviour of the Snow Bunting in spring. Trans. 
Linn. Soc. N.Y., 1939, 5, 1-94. 

212. Tinbergen, N. Die Ubersprungbewegung. Z. Tierpsychol., 1940, 4, 
1-10. 

213. Tinbergen, N. An objectivistic study of the innate behaviour of 
animals. Biblioth. Biotheor., 1942, 1, 39-98. 

214. Tinbergen, N. Social releasers and the experimental method re- 
quired for their study. Wilson Bull., 1948, 60, 6-52. 

215. Tinbergen, N. The hierarchial organization of the nervous mecha- 
nisms underlying instinctive behaviour. Sym. Soc. exp. Biol., 1950. Pp. 305- 
312. 

216. Tinbergen, N. The study of instinct. London: Oxford Univer. Press, 
1951. 

217. Tinbergen, N. Recent advances in the study of bird behaviour. 
Proc. Xth int. Ornithol. Congr. Uppsala, 1950, 360-374. 

218. Tinbergen, N. Derived activites: their causation, biological sig- 
ee, origin and emancipation during evolution. Quart. Rev. Biol., 1952, 
‚1-32. A 

219. Tinbergen, N. Social behaviour in animals. London: Methuen, 
1953. 

220. Tinbergen, N. The herring gull’s world. London: Collins, 1953. 

221. Tinbergen, N. The origin and evolution of courtship and threat 
display. In A. C. Hardy, J. S. Huxley, & E. B. Ford, (Eds.), Evolution as 
a process. London: Allen & Unwin, 1954. d 

222. Tinbergen, N. In B. Schaffner (Ed.), Transactions of the First 
Conference on Group Processes. New York: Macy Foundation, 1955. ; 

223. Tinbergen, N., & van Iersel, J. A. A. “Displacement reactions” in 
the Three-spined Stickleback. Behaviour, 1947, 1, 56-63. A 

224. Tinbergen, N., & Kruyt, W. Über die Orientierung des Bienen- 
wolfes. ITI. Zeits. vergl. Physiol., 1938, 25, 292-334. i 

225. Tinbergen, N., & Kuenen, D. J. Über die auslösenden und die 
richtunggebenden Reizsituation der Sperrbewegung von jungen Drosseln. 
Z. Tierpsychol., 1939, 3, 37-60. 

226. Tinbergen, N., Meeuse, B. B. D., Boerema, L. K., & Varossieau, 
W. W. Die Balz des Samtfalters Eumenis (= Satyrus) semele (L.) Z. Tier- 
Psychol., 1942, 5, 182-226. we A 

227. Tinbergen, N., & Perdeck, A. C. On the stimulus situation releasing 
the begging response in the newly-hatched Herring Gull chick (Larus a. 
argentatus Pont.) . Behaviour, 1950, 3, 1-38. A F 
Toulmin, S. E. The philosophy of science. London: Hutchinson, 


610 R. A. HINDE 


229. Uexküll, J. von. Umwelt und Innenwelt der Tiere. Berlin: 
Springer, 1909. 

230. Verwey, J. Die Paarungsbiologie des Fischreihers (Ardea cinerea 
L.). Verh. VI int. Ornithol. Kong., Kopenhagen, 1926, 390-413. 

231. Verplanck, W. S. Burrhus F. Skinner. In Estes, W. K., et al., 
Modern learning theory. New York: Appleton-Century-Crofts, 1954. 

232. Vowles, D. M. The orientation of ants. I and II. J. exp. Biol., 1955, 
31, 341-355; 356-375. 

233. Vowles, D. M. The foraging of ants. Brit. J. Anim. Behav., 1955, 
3, 1-13. 

234. Weidmann, U. Some reproductive activities of the Common Gull, 
Larus canus L. Ardea, 1955, 43, 85-131. 

235. Whitman, C. ©. The behavior of pigeons. Publications, Carnegie 
Inst. Wash., 1919, 257, 1-161. 

236. Wickler, W. Das Fortpflanzungsverhalten des Keilflickbarbe Ras- 
bara heteromorpha. Z. Tierpsychol., 1955, 12, 220-228. 

237. Zippelius, H. M. Die Paarungsbiologie einiger Orthopteren-arten. 
Z. Tierpsychol., 1949, 6, 372-390. 


INFORMATION THEORY’ 


F. C. FRICK 
Lincoln Laboratory, Massachusetts Institute of Technology 


Introduction 41 4 Kr oO. Ur Alta) eT SSD seeder 611 

The Basic Theory 24,446} =» . 0m mn nn hoe He os 614 

The Application of Information Theory in Behavioral Studies {7-9+} . : 629 

References. syed ela mvarta ood 
INTRODUCTION 


Information theory is a formal—as opposed to substantive—theory. 
It could be regarded as simply an extension of correlation theory. It is 
not a model of behavior, not even communicative behavior, but rather 
a tool that may be used in the construction of such models. 

In barest outline, the theory consists of a measure, defined for any 
“message,” where messages are considered to be the outputs of a Markoff 
process. The theory justifies the particular measure chosen in that it 
permits us to establish equivalences among messages (the “coding 
theorem”) and in that it permits us to establish a limit to the degree 
of association, or correlation, that may be achieved between messages 
(the “channel theorem”). It states nothing about the meaning or value 
of a message, nor how or why messages are correlated or transmitted. 

Strictly speaking, the structure of the theory is laid out in the mathe- 
matical proofs of the central theorems, and because of its formal nature, 
it cannot readily be warped into an analytic rubric of the sort that might 
be used in the exposition of substantive theories. The “independent- 
intervening-dependent variable” schema is not only inappropriate to the 
analysis of formal theory but may be misleading. Such a treatment 1n- 
evitably focuses attention on the observational basis for the theory and 
is, in fact, specifically designed to expose the translations between ex- 
perimental data and the theoretical model. Although a formal theory 
often develops out of experimental research or the examination of some 
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class of physical phenomena, such a theory is no more validated by ex- 
periment than cartesian coordinates are validated by the data which are 
plotted on them. In short, formal theories do not have empirical pre- 
dictive value, nor do they suggest hypotheses other than the original 
hypothesis that this particular organization of the data may be interest- 
ing, i.e., may lead to substantive theory. 

Why then should information theory be included in a volume devoted 
to the analytic exposition of psychological theory? One reason, of course, 
is that the distinction between formal and substantive theory is not 
always obvious. Information theory, in particular, has suffered from 
those (not only psychologists) who have interpreted it as a substantive 
model and have, for instance, been able to confound thermodynamics 
and the statistical structure of language [1]. Of more importance, how- 
ever, is the fact that formal theories very often have an effect outside 
their immediate province. To borrow an expression from Bush and 
Mosteller [2], they serve as models for experiments rather than models 
of the organism, and in this way they have a profound, though indirect, 
influence on the behavioral model which may be developed from the 
experimental data. 

This is to say more than simply that the use of the information meas- 
ure entails certain assumptions about the data, or that it places certain 
constraints on experimental design. Clearly it does this, even as the use 
of coordinate theory requires ratio scales of measurement for the variables 
considered. And the assumptions that are involved in the use of informa- 
tion measure are pointed out as clearly as possible in the exposition of the 
theory which follows, In addition, however, the information-theoretic 
approach reflects an attitude, or opinion, about the subject matter to 
which it is directed. This viewpoint is, perhaps, not immediately apparent 
from an exposition of the formal model and is worth summarizing here. 

Information theory is one item in an entire context of related theories 
devoted to the description and analysis of time series, or stochastic proc- 
esses. An informational analysis is appropriate, therefore, only where it is 
believed that the process under investigation is intrinsically statistical. 
Historically, it was the prior recognition of communication as a statistical 
process in this sense that led to the development of information theory. 
As a belief, with respect to psychological experimentation, it is important 
because it necessitates a consideration of the temporal sequence of events 
and the sampling space, or context, in which behavior occurs. It leads 
us away from the immediate stimulus and observed response and de- 
mands that we consider the set of alternative possible stimuli and re- 
sponses. The usual S-R functional R = f( S) is interpreted as a statistical 
distribution function and the algorithm of differential equations is no 
longer appropriate to the description of the data. 
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It is possible, of course, to regard the variables of behavior as random, 
in the statistical sense, without adopting the particular formalism of in- 
formation theory. This, for instance, is the point of view adopted by 
Estes, Bush and Mosteller, Cronbach, and others who are in no sense 
information theorists. However, the existence of information theory as 
a ready-made formal model of considerable generality has stimulated this 
sort of approach to a number of physical and psychological phenomena. 
It has, in fact, been argued [see 11] that this indirect contribution may 
be the most important consequence of the theoretical development. 

Aside from the assumption of a statistical process, the viewpoint of 
information theory reflects a somewhat more subtle bias. Formal theories 
are often motivated by the study of some class of physical events, and 
information theory, in particular, grew directly out of the investigation of 
electronic communication systems. As is essential to the formulation of 
any theory, the development proceeded by the abstraction of some prop- 
erty of the process under consideration. In the present case, what was 
considered to be the significant feature of the process was the difficulty 
of communicating. Thus, the coding theorem establishes an equivalence 
between messages in terms of the length of a standard message. The 
channel theorem sets limits on the degree of correlation that can be 
achieved between messages (the input and output of the channel) as a 
function of the channel characteristics. In short, the formalism of the 
theory is directed at a determination of the efficiency of communication, 
and the application of information theory to psychological data implies 
an interest in the efficiency, rather than the structure, of the process 
under study. The theory is, in this sense, normative. 

A normative approach to behavioral processes tends to discount the 
usual schema for categorizing behavior. For example, the distinctions 
that are often made among various types of learning—motor, verbal, 
concept formation, conditioning, etc.—do not seem particularly relevant 
in a context that examines how closely the subject approaches the maxi- 
mum possible correlation between one random variable, the stimulus 
process, and another random variable, the response process. The interest 
is in how the subject selects and codes the available stimuli, how effec- 
tively he processes his inputs, rather than the particular form in which 
the input is presented. Here, too, however, it should be emphasized that 
such a viewpoint does not necessarily imply adoption of the formalism of 
information theory. Most of the normative studies of behavior, investiga- 
tions of decision making, the strategies of concept attainment, and the 
efficiency of various training procedures have not, in fact, employed in- 
formation theory. i 2 

The point to be made is that the use of information theory entails a 
systematic position: the viewpoint that behavior is stochastic and that 
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the efficiency of the behavioral process is a significant aspect for study. 
The converse is not true, and we may adopt the viewpoint without the 
verbal or mathematical trappings of information theory. 

Unfortunately it is still too early to evaluate the effects on psychology 
of information theory itself, or the conceptual position which it repre- 
sents. The present author’s biases in this respect should, however, be 
stated. Essentially they are: (1) the viewpoint represented by the theory 
is fruitful and important, and (2) the mathematical formalism does not 
lend itself to uncritical, mechanical application. Even if the data are 
believed to meet the mathematical requirements, the computation of 
information transmission, say, does not in itself insure any deeper under- 
standing of the process under study. This is, of course, characteristic of 
formal theory in the sense that we have been using the term; it is, how- 
ever, a point that may be disguised by the computational complexities 
involved. 


THE BASIC THEORY 


The insight that led to the development of information theory was 
the realization that all the processes which might be said to convey in- 
formation are basically selection processes. Speech, for example, can be 
regarded as a sequence of selections from a number of possible choices— 
the phonemes, letters, or words of the language. Alternatively, we need 
information only when we are faced with a choice of some sort. If I know 
the road to Boston, I do not need a route sign at the intersection. In the 
same sense, a completely determinate process would not be regarded as 
an information source. For as it moves through its appointed course, I 
can predict its motions in advance, and I receive no new information 
when it actually carries out its function. 

Thus, information and ignorance, choice, prediction, and uncertainty 
are all intimately related. On the other hand, complete ignorance or 
indeterminance also precludes information transmission. A lecture in 
German is not informative to a listener who does not understand Ger- 
man. There must be some degree of agreement, some sort of common 
language established between the information source and the receiver. 
Put somewhat more precisely: information processes are selection proc- 
esses, but these selections must be made from a specific set of alternatives, 
and if the sequence of selections is to convey information, the possible 
choices must be known to the receiver. To say “yes” or “no” to someone 
who has not asked a question is not informative in any usual sense of the 
word. It is this restriction on the set of alternatives that makes it possible 
to speak precisely and quantitatively about information. 

Within these bounds of complete knowledge and complete ignorance, 
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it seems intuitively reasonable to speak of degrees of uncertainty. The 
wider the choice, the larger the set of alternatives open to use, the more 
uncertain we are as to how to proceed—the more information we require 
in order to make our decision. Thus, color alone will distinguish an apple 
from an orange, but we will have to know more than just color if we 
are to distinguish among apples, oranges, grapefruit, and lemons. 

In much the same way, the informational value of a choice seems also 
to depend upon the likelihood of that choice. If we feel certain of what 
a man is going to say, it is not very informative when he says it. If a 
process is highly predictable, we need only a few observations to estab- 
lish its present state quite accurately. Our uncertainties are intimately 
tied up with probability estimates and if we are to fit our intuitive notions 
regarding information, we must consider not only the range of choices 
available but the probabilities associated with each. This reflects the sec- 
ond basic insight behind the development of information theory. We may 
summarize the position by stating two premises: (1) information js as- 
sociated with a selection process. (2) Such a process is basically statistical 
in the sense that it involves probability considerations. 

From these premises, it has been possible to develop a measure of 
amount of information and a set of limit theorems which justifies the 
particular measure chosen. The ultimate justification of the theory will 
lie in the extent to which it gives precision to our intuitive notions re- 
garding information and information transmission and opens the way to 
a clearer understanding of the basic processes involved, 

Accordingly, we will start our discussion of information theory by 
defining a unit of information which we will state to be the amount of 
information associated with a selection between two equally likely alter- 
natives. A binary choice is clearly the minimum condition for any in- 
formation transmission at all, and the maximum uncertainty, and hence 
the maximum information associated with such a situation, would seem 
to exist when the alternatives are equally likely. This amount of informa- 
tion we will term one “bit,” and if we use the symbols H (N) to stand 
for the amount of information associated with a choice among N equally 


likely alternatives we can write: 
H(2) = 1 bit 


Consistent with our intuitions, we shall also ous that n independent 
i i i i i tion. 
selections of this type shall constitute n bits of information. : 
Our first problem will then be to extend this definition to ieee 
among any finite number of alternatives. There are several ways in whic 


we could proceed with such an extension but we shall, in general, follow 


Fano’s development [8, 9]. This is mathematically less elegant than 
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Shannon’s [32], but it exhibits the important concept of “coding” in a 
clearer fashion. 

We shall begin by considering a set of N elements, where N is some 
power of 2. It is very easy to demonstrate that any element of this set can 
be encoded into a sequence of independent selections from a set of two 
equally likely choices. That is, we can set up a one-to-one correspondence 
between each element of our original set and a set of binary numbers. 
Each binary digit will carry one bit of information according to our 
definition. The number of such selections required will equal the infor- 
mational value in bits associated with each element in our original set. 
This coding procedure, by which we establish a correspondence between 
a given set of elements and a set of known informational value, is, as we 
shall see, basic to the development and understanding of information 
theory. 

To illustrate the procedure, let N = 8, and arrange these eight alter- 
natives in a row, the order of arrangement being arbitrary. We can now 


1 2 3 4 5 6 7 8 


MI a a ee mea hls e ot 
HOLL Ot 10 Poy a ea]! ast aivision 
t T 1 1 + 
‚oh lt) Kali {or To] Tay! 2 division 
1 1 1 1 a 
us; U el; dt del; Id} ol; iE, a vision 


Fic. 1. Simple binary coding procedure. 


divide this set into two equal subsets, assigning the number O to one 
and the number 1 to the other. We now further divide each subset into 
two equal subsets, again assigning binary digits as before. If we now 
halve these subsets and assign binary digits we will have associated with 
each alternative in our original set, a unique three-digit binary number 
as shown in Fig. 1. Thus, we see that a selection from N alternatives can 
be indicated by a sequence of selections from a smaller number of alter- 
natives. In general, where N = 2", the number n of binary digits re- 
quired so as to uniquely represent each element of N will be log: N. Ac- 
cordingly, we may write: 


H(N) = log; N 


which is the amount of information, in bits, associated with the selection 
of one out of a set of N equally probable alternatives. 

Strictly speaking, however, this result will hold only if N is a power 
of 2, in which case H is an integer. What Shannon has been able to do 
is to develop another information function with no such restriction on N. 
In order to accomplish this, however, we must restrict our definition of 
information as applying only to the average of a large number of selec- 
tions, and in so doing the basically statistical nature of information be- 
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comes apparent. As we are seldom, if ever, interested in any single, or 
unique, selection from a set of choices this restriction will not bother. 
It does, however, put us in the realm of probability theory, and any 
application of information theory must be made with an awareness of 
the basic assumptions underlying such a probability model. 
Consider, for example, the case where N = 3. We encode into binary 
selections as before: 
N Binary code 
1 11 
2 10 
3 01 
It is clear that alternative 3 requires only the first digit to specify it 
uniquely. Intuitively, the number of bits associated with N lies some- 
where between 1 and 2. At this stage, however, it is not clear what mean- 
ing can be given to a fraction of a selection. . 
Accordingly, let us consider long sequences made up of n successive 
selections from some set of N alternatives. A sample of written English, 
where the alternatives consist of the letters of the alphabet, would be an 
example. To simplify our description at this point, we will, however, as- 
sume that unlike English sequences the selections are independent and 
equally likely, such sequences as might be generated by successive rolls 
of a die. 
The set of sequences thus formed we will call an ensemble of mes- 
sages, and the number of different messages in the ensemble will be: 
M = N” 
We can encode this number into binary digits, but unless N is a power of 
2, the length of the binary sequence required will, as before, depend on 
the particular message selected. However, it now becomes meaningful to 
ask how much information is associated with each symbol in the message, 
on the average. This can be shown to equal log: N as follows: j 
Let B(M) equal the number of binary, equally likely selections re- 
quired to represent each message. Then 
B(M) = log: M + d 
where d is a number, smaller than unity, required to make B(M ) an 
integer. If N is a power of 2,d=0. In general, however, d will ec 
on the particular message or sequence selected. The average amount © 
information per selection, H(N), will equal the amount of information 


in the message, divided by the number of selections in the message: 


H(N) = lim : logs (N” + d) 
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Since N is a constant and d is smaller than unity, this equation yields: 
H(N) = logs N 


In words, the amount of information associated with each selection 
of a sequence of selections from N equally likely alternatives will, on 
the average, equal log: N. Alternatively, independent selections from N 
equally likely alternatives can be represented by sequences of binary 
digits, the average length of which will, for sufficiently long messages, 
equal log: N. 

It remains for us now to remove the remaining restrictions on the 
measure we have defined. We shall first consider the case where the 
selections, though independent, are not equally likely. In order to deal 
with this case, it will clearly be necessary to introduce probability dis- 
tributions over the set of alternatives. 

We begin by defining a source § which makes selections (with re- 
placement) from a finite set of alternatives. For the time being, we shall 
assume that these selections are independent; i.e., the present selection 
is in no way influenced by previous selections. Such a source will gen- 
erate a sequence of symbols representing the selections made, and it will 
be necessary to assume that the series of symbols thus generated is 
ergodic. This is to say that the statistical parameters characterizing the 
selective process do not vary with time. If we let p(i) be the average 
frequency with which the symbol i turns up, this average will be ap- 
proached to any small error in samples taken anywhere in the series—if 
the samples are sufficiently long or sufficiently numerous. Alternatively, 
we can say that the source is statistically stationary. 

Assuming such a source § we will now introduce some necessary no- 
tation. Let k be the number of mutually exclusive alternatives from 
which the source makes its selections. Then P(t) sewhere. 2-142, 605 l 
Then ESADE probability that the symbol i will be selected and pij) 
the probability that the symbols i and j are selected in that order. Such a 
source will generate an ordered sequence of symbols irizis * * * in, with 
a probability of occurrence b(iiz ` + + ix). Our assumption of inde- 
pendence in selection means that for every k and every possible sequence 
hii ie 


Pliz + + + ix) = plir)p(ie) - - + plis) 


The source is thus completely specified by the set of probabilities 
p(is) (iz) -> + p(ix) over the k selections available to §. What is re- 
quired is a function that will assign a number to such a probability dis- 
tribution in a fashion analogous to the way in which we have previously 
assigned an information measure. 


A plausible way of accomplishing this task is simply to extend our 
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preceding definition in the following manner. Consider any event of 
probability p = 1/n; we may treat this event as one among n equally 
likely alternatives, and the amount of information associated with its 
selection is 


logs n = logs 5 = — logs p 


If we assume that probability is a continuous function over the set 
of alternatives, we may extend this definition to irrational p values, by 
using an approximation by rationals. Thus we will assign to the selection 
of symbol i, which occurs with probability (i), an information value 
of — log» p(i) bits. Clearly, alternatives of differing probability will 
have different information values with less information attributed to 
higher probability alternatives, thus fitting our intuitive notion that the 
unexpected event is somehow more informative than the relatively cer- 
tain occurrence. 

Each selection generated by the source S will, in general, have a 
different informational value. As before, we consider long sequences of 
selections (messages) and under these conditions the average, or ex- 
pected, value of the function — log» p becomes meaningful. This value 
we can obtain by summing over the amount of information associated 
with each selection, weighted by the probability that the selection will 
occur. Accordingly, we define the average amount of information as- 
sociated with any selection to be 


$ 
H=- 2,20 logs p(i) 


which is the expected value of the function — log: p. 

It remains for us to extend this definition to the most general case, 
where no assumption is made regarding the independence of the source 
selections. Here, we consider the selection of one symbol, i, from the set 
S=1, 2, .. . , k followed by a second selection j. Where before we 
assumed that p(ij) = p(i)p(j), we will now assume that in general, 
plij) = p(i)p(j)- 2 

a es this case is subsumed under our assumption of an 
ergodic source. The statistical properties of such a source are completely 
defined by the set of p(i), and as the selections can be selections of 
single symbols, or groups of symbols (for example, single letters, or words, 
or texts of any length), the set of p(i), in this general sense, already takes 
into account such correlations as may exist between the more elementary 
symbols composing the selection i. Furthermore, any correlations that do 
exist must fade out over sufficiently long messages, as they cannot have 
any influence on the averages of shorter messages, picked at random. In 
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short, the ergodic assumption permits us, with only a notational difference 
in the (i), to extend our definition of H to those cases where the 
source selections are not independent. 

Such highhanded treatment, however, serves only to indicate the 
breadth of the assumption of statistical homogeneity. Because most 
sources of interest are not independent, it is worth examining this case in 
somewhat more detail. 

As noted above, we are concerned with sequences exhibiting internal 
dependencies. This is to say that the current choice will be affected by 
the immediately preceding choice, or choices. Such a situation is most 
easily described in terms of conditional probabilities—the probability of 
a particular choice j, given the immediately preceding choice 7. This joint 
probability, p(ij), can be written: 


PY) = DOLG) 
in words, the probability of a sequence ij is equal to the probability of 


the occurrence of i times the conditional probability of j, given i. If 
selections are independent 


BU) = pU) 
and PG) = LOG) 
Following any given selection i, the situation will differ from the inde- 
pendent case only in that the information value will depend on the con- 


ditional probabilities p:(j) rather than the unconditional p(j). Accord- 
ingly, we can write: 


k 
Hy) = — > bij) logs pi(j) 


where Hi(y) is the amount of information associated with the selection 
made whenever the system is in the state of having just selected 7, We can 
expect to be in state 7 with a probability p(i), so in a long sequence of 
choices we will have p(i)H;(y) bits of information associated with the 


state 7. If, as before, we take the expected value over all the states of the 
system we get: 


k 
H = X POHO) = — Y pepsi) tows pi) 


which is the average amount of information per symbol of the sequence. 

It should be noted that this definition continues to hold where de- 
pendencies extend over more than one symbol. In this case, the summa- 
tion must simply be extended to cover all possible sequences of length 
n preceding the choice j, hence, the earlier reference to i as the “state” 
of the system, rather than the immediately preceding choice. 


Information Theory 621 


At this point, it is probably worth looking back over what we have 
accomplished. We have defined, on an intuitive basis, a unit of informa- 
tion, the “bit,” and we have indicated how any set of alternatives can be 
mapped into sequences of one-bit selections. In addition, we have de- 
fined a measure H for any set of events whose probabilities pı, pe, . . - , 
px are known. This measure is, in fact, simply the mean logarithmic 
probability of the events or symbols considered. The justification for 
terming this: measure an “amount of information” is given in what has 
been called Shannon’s Fundamental Coding Theorem, in which it is 
stated that messages of an ensemble having an average information value 
H can be encoded into not less than H binary, equally likely symbols. 
Furthermore, this number can be approached as closely as desired by 
encoding together increasingly long sequences of messages. 

We shall not give the proof of this theorem. However, its plausibility 
is evident from the following coding procedure, developed independently 
by Shannon [32] and Fano [8]. The procedure parallels the scheme used 
before, In this case, however, we cannot partition the ensemble of mes- 
Sages into equal sets, as the messages are of unequal probability, and our 
selections of sets and subsets would no longer be selections between 
equally likely alternatives; i.e., the selections would no longer meet our 
definition of a bit. The trick now is to make the probability of each sub- 
set as nearly equal as possible, rather than to equate the number of mes- 
Sages in each. 

To accomplish this, we consider all messages of length n and arrange 
them in order of decreasing probability. We divide this collection into 
two sets of as nearly equal probability as possible and assign binary 
digits as before. In a similar fashion we divide the two sets into equally 
probable subsets and continue the process until each subset contains only 
one message. This is, of course, an approximation procedure, the success 
of which will depend on how closely we can equalize the probabilities 
between subsets. It should be intuitively clear, however, that as we con- 
sider longer messages our accuracy can be improved as the probability of 
each message approaches zero. h ` 

It is perhaps surprising—and of considerable importance in practical 
applications of information theory—how good an approximation we can 
obtain with very short messages. The following example illustrates this 
Point, as well as the coding procedure. / 

Consider a source which generates messages by independent selections 
among three alternatives, A, B, C, with probabilities: p(A) =, 
b(B) = .3, p(C) = .2. By our definition of H, the average information 
associated with each selection will equal: 


—(.5 logs +5 + 3 loge: 3 + 2 logs - 2) = 1.486 bits 
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If we consider messages of length two, we can arrange them, in order 
of decreasing probability, p(i), and encode as follows: 


Message | (7) | Recoded message PG) ( BP ee 
AA 25 00 .50 
AB 15 010 .45 
BA 15 Opies 45 
AC 10 100 .30 
CA 10 1. Orat .30 
BB 09 Tie loge an .30 
BC 06 Pens +24 
CB 06 Lert 24 
cc 04 sO eee .16 
3.00 (Av. bits per message) 


In the recorded version, the average information per selection = 1.5 
bits, contrasted with the 1.486 bits computed by the equation for H. If 
we recode into messages of length three, the average information per 
selection works out to be 1.496 bits.? 

The coding theorem essentially states that all message ensembles hav- 
ing the same information measure are equivalent for coding purposes. 
The importance of this result will be apparent when we consider the 
problem of information transmission and the applications of information 
theory. 

In the preceding discussion, we have gone to some length to present 
the concept of selective information as defined by information theory. 
We have done this primarily to emphasize the difference between the 
technical usage of the term and our ordinary conversational usage. “In- 
formation” as used in information theory is a measure of the size of a 
message set. It establishes equivalences among message sets with respect 
to coding, but it says nothing about the value, importance, or conse- 
quences of the message. 

The central result of the theory has been a set of limiting theorems 
regarding the transmission of information, and the concept of informa- 
tion transmission has been applied extensively to psychological problems. 
Before taking this up, however, we will state two theorems concerning H. 
These establish intuitively reasonable properties of the measure, and their 
proofs may be found elsewhere [32] by those interested in proofs. 

1. H=0 always and H = 0 only if all (i) except one are zero. 


? This coding is not always so well behaved and may approach the limiting 
value —=p (i) log: p(i) in an oscillatory fashion. 
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This simply says that the measure is nonnegative and that no infor- 
mation is involved when the selection is certain. 

2. The maximum value of H is log: N and this is achieved only 
when all the (7) are equal. 

According to this theorem, the amount of information associated with 
a sequence of independent binary selections is a maximum when the 
selections are equally likely. Conversely, it will always be possible to 
represent a sequence of binary, not equally likely selections with a se- 
quence which will on the average be shorter. It follows that no binary 
representation of a message can be made with a number of selections 
smaller than the amount of information associated with that message. 
Consequently, the average amount of information associated with a 
message equals the minimum number of independent, binary, equally 
likely selections (bits) required to specify that message. By much the 
same reasoning it is possible to show that a larger number of selections 
will be required, if the selections are not independent. 

We turn now to a consideration of information transmission. This 
could be treated as simply an extension of the concept of coding which 
we have already developed; however, there is considerable heuristic 
value in considering information transmission within the framework of 
a physical communication system. Such an approach also ties in with the 
historical development of the theory and permits us to introduce in a 
natural way such terms as “equivocation,” “noise,” and “channel capac- 
ity” which have become so closely associated with information theory. 

Basically, a communication system consists of three parts: a message 
source of the type that we have already defined, a channel, which is a 
fixed transmission system that establishes a connection between the 
source and the destination. The physical realization of such a system will 
ordinarily be somewhat more complicated. In general, the decisions 
made by the source will have to be encoded into a form that is suitable 
to the channel. A communication system will thus often include a trans- 
mitter which “matches” the channel to the source, for example, changing 
sound pressure into electrical current. A receiver may be necessary to 
perform a similar matching between the channel and the destination. In 
many cases, these will be inverse operations where we can think of the 
transmitter as encoding a message and the receiver as decoding it. _ 

In any physical system, we must also consider the possibility of 
“noise.” The message received at the destination will seldom, if ever, be 
a perfect representation of the message generated by the source. The 
received message will thus be a function of the source message plus 
something else. It is as if there were a second source operating through 
the same channel, and we have so represented it in Fig. 2. The char- 
acteristic of this second source is that it introduces messages into the 
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channel which neither the original source nor the destination can predict 
in detail. 

It should be noted that this is a restrictive definition of noise. For 
instance, the hum in a radio set is not noise in this sense, It is a fixed 
distortion that we can, in principle, build a network to filter out. The 
case in which we are interested here is that in which the message does 
not always undergo the same change in transmission. Under these condi- 
tions noise can be thought of as a statistical process analogous to the 
statistical process which generates messages. 

Using this abstract model of a communication system as a basis, we 
can begin to clarify the concept of information transmission. We are 
essentially concerned with the relations between a number of information 
sources. Viewed from the destination we have a sequence of selections or 
symbols, generated by the channel,’ which is regarded as a fixed trans- 
mission system with a fixed set of transmission symbols among which the 
selections are made. These selections are related through the channel to 


Fic, 2. Generalized representation of a communication system. 


the selections made by the message source, and the amount of informa- 
tion transmitted clearly depends on the correspondence between the 
selections made by the message source and the sequence of symbols re- 
ceived at the destination. 

The possible correspondences that might exist are schematized in 
Fig. 3. In I, we have represented the simple coding operation. There 
is a one-to-one correspondence between the input symbols (x) and 
the output (y). In communication terms this would correspond to 
transmission through a noiseless channel and the conditional probabilities, 

(91), Peal Y2)s - . . , are all equal to 1. 

II and III represent, respectively, one-many and many-one relations. 
In II it is not possible to predict the output, given the input. If we as- 
sume that the input selections are equally likely we can compute 
H(x) = 1 and H(y) = 2 from the conditional probability matrix shown 

* Strictly, we should speak of the input to the transmitter and the output from 
the receiver. This, however, tends to complicate description and it should be 
obvious that the formal model is not affected by what we choose as input or output. 


Abstractly, we are simply dealing with the relations between two random variables 
— whatever their source. 
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and we see that, in effect, information has been added by the channel. 
This corresponds to “noise” as we defined it above and justifies our 
representation of noise as a separate information source. 

III is the transpose of II. Here it is possible to predict the output, 
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Fıc. 3. Schema of the possible correspondences between 
the inputs (x) to a channel and the outputs (y). The 
right-hand figure in each case represents the condi- 
tional probability matrix; cell entries are the prob- 
ability of y, given x = p=(y). The conditional prob- 
abilities indicated for Cases II, III, and IV are 


arbitrary. 


given the input. The output, however, is equivocal in that knowing the 
output alone does not permit us to specify the input. In communication 
theory terms a many-one coding of this sort corresponds to “equivoca- 
tion.” 
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In most physical systems, both noise and equivocation are present as 
represented by IV. Here again the channel is completely defined by a 
set of conditional probabilities specifying the relation between the mes- 
sage generated by the source and the sequence of transmission symbols. 

Perhaps the strongest result of information theory has been the 
development of a limit theorem for information transmission in the 
general case of a noisy channel as represented in Fig. 3, IV. Before 
stating this fundamental theorem it will, however, be necessary to state 
several theorems regarding the information associated with joint dis- 
tributions. 

We now consider two sources, as defined before. These sources will 
generate sequences of symbols, x and y respectively. We let p(i) be the 
probability of x=i and p(j) be the probability of y = j. For each 
source we have an information measure H(x) and H(y). In an anal- 
ogous fashion we can define an information measure for the joint 
distribution, 


Hoy) = — X pli) logs pli) 


noting that the distribution p(ij) differs only in notation from any of 
the sources that we have considered before. 
Furthermore, it can be shown that: 


H(xy) < H(x) + H(y) 
with H(xy) = H(x) + H(y) only if the sequences x and y are in- 
dependent. 


Just as before we can define informational measures for the condi- 
tional probability distributions 


H.O) = — pi) logs pili) 
and H(z) = — Y pli) loge p,(i) 


Hz(¥) is the amount of information added in transmission—the amount 
of information associated with the noise source. Alternatively, it can be 
interpreted as a measure of the average uncertainty we have regarding 
the output, when we know the input. Similarly, H,(x) is a measure of 
equivocation and can be regarded as the average uncertainty of the 
input, knowing the output. 

Using the definition of a conditional probability Pi(j) = p(ij)/p(i) 
and a little algebra, it is easy to show the following relations between 
these measures: 


A(x) + HO) > Hw) = H(x) + H,(y) = Hy) + H) 
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They are schematized in Fig. 4, using the Euler circle diagram 
suggested by Miller [23]. 

From this diagram it is also apparent that we may compute another 
quantity 7 which is the amount of information that is common to x 
and y. If we think of x as the input and y as the output from the 
channel, T' will measure the amount of information associated with the 
input that appears in the output, i.e., the amount of information trans- 
mitted. This quantity can be defined: 


T = H(x) + H(y) — Hy) = H(«) — H,(@) = H(y) — H.(y) 


a set of equalities, that is easily checked by reference to the diagram. Also, 
it can be seen that if x and y are independent, H(xy) = H (x) + H(y) 
and T = O. There is, as we should expect, no information transmission. 


Fic. 4. Schematic representation of the  H(x) H(y) 


relations between the information 
measures of two associated sources. 


ie gta 


This definition gives us a measure of the average amount of in- 
formation per selection that we can transmit over a given channel. How- 
ever, we have no assurance that the receiver will be able to sort out 
the information associated with the source from the information which 
has been “added” by the noise. If, for example, I know that some per- 
centage of the symbols received is incorrect, I have not been helped 
much until I know which symbols are in error. In short, the output of 
the channel may still be equivocal. If our measure of information trans- 
mitted is to have any practical significance, we must somehow insure that 
this amount of information is received correctly. 

This problem has been solved by Shannon, who has been able z 
show that by appropriate coding it is possible to reduce the per symbo 
equivocation to any desired tolerance and still transmit T bits of ie 
tion per selection, on the average. This result justifies the use of as 
a measure of information transmission, for it states that we can (in 
theory) “make the probability of incorrect reception of a message as 
small as desired; and the number of binary selections used in the trans- 
mission does not need to be larger, on the average, than the information 
content of the message divided by T” [10]. An jndication of the way in 
which such a result can be achieved is given below, adapted from Fano. 
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Noise in the channel entails that we are never completely certain that 
the signal received corresponds to the signal sent. However, it is clear 
that if we utilize only particular signals, or particular sequences of 
signals, we can reduce the probability of error. For instance, the two 
binary sequences 00000 and 11111 will never be confused so long as not 
more than two digits are changed by the noise. If we use long enough 
Sequences of symbols, we can select particular sequences such that the 
probability of confusion among them is as small as we like. We then 
encode the messages generated by the source into this set of channel 
outputs. The first operation is generally referred to as “matching” the 
channel because the procedure used in any given case will depend on the 
channel characteristics, i.e., the particular set of conditional probabilities 
associated with the particular channel under consideration. It is a one- 
many coding, and the, perhaps, surprising result of Shannon’s theorem 
is the fact that the average amount of information per symbol does not 
approach zero but converges on T. 

With this result we have essentially concluded our outline of the 
formal aspects of information theory. Two addenda are needed before 
taking a summary look at the basic assumptions and psychological in- 
terpretations of the model. 

In the first place, we have developed the theory without reference 
to the rate of information transmission. For the communication engineer 
the rate of transmission is of prime importance. In psychological ap- 
plications time does not often enter in as natural a way. There are, 
however, no difficulties involved in using a rate measure which will, of 
course, equal the bits per symbol times the number of symbols generated 
per second. Thus we can define the channel capacity as the maximum 
possible information transmission in either bits per symbol or bits per 
second, where the maximum is with respect to all possible information 
sources used as inputs to the channel. 

In the second place, we have not included in our discussion the ex- 
tension of the theory to information sources whose outputs are continuous 
functions of time. Here again the concern of the communication en- 
gineer and the psychologist appears to differ, and to date, the theory of 
continuous transmission has not been of great interest in psychological 
applications. Suffice it to say that Shannon has shown that continuous 
transmission in the presence of noise does not differ essentially from 
transmission of information by discrete selections of the sort that we have 


considered. The information measure of a continuous source which is 
defined: 


(x) = — f p(x) loge p(x) 
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is not an exact analogue of the measure of a discrete source in that it is 
a measure relative to a coordinate system. If the coordinate system is 
changed, the measure changes. Information transmission, on the other 
hand, involves a difference between two sources, so that it is invariant 
under coordinate transformation, and the results of the theory can be 
obtained essentially by substituting integration for the summation that 
we have used in the preceding discussion. For those interested in the 
continuous theory a brief summary is available in Luce [19]. 


THE APPLICATION OF INFORMATION THEORY IN 
BEHAVIORAL STUDIES 


Information theory is a probability model and suffers from all the 
conceptual difficulties associated with probability theory. In particular, 
the definitions and theorems of the theory assume that it is meaningful to 
speak of infinite samples of past symbol occurrences and the expected 
value for infinite samples of future messages. The theory assumes that we 
know the parameter values of a statistical process and is thus subject to 
the same cricticisms in application that have been leveled at Bayes’ 
theorem. In fact the expression for T can be derived from Bayes’ result 


3]. 
ae is not a new problem to psychologists, but it is worth pointing 
out that the passage from probability theory to statistics has not yet 
been achieved for information measure. In applications of the theory, 
the experimenter must be prepared to assume that the frequencies with 
which events occur are reasonable estimates of the true probabilities with 
which the model deals. In fact, the maximum likelihood estimates, 
H and T, have been shown to be biased [24] and general corrections 
for these biases are not available. Nor do we know the sampling dis- 
tribution of H. cl ; 2 
Despite these clear difficulties with the application of information 
measures to finite sequences and limited data, the large number of ex- 
perimental studies that have appeared in recent years using information 
measures is an indication that many researchers are willing to make 
statements about the probability distributions characterizing their data. 
Actually, the use of information theory to describe empirical data is not 


as unreasonable as it might at first appear. à : 
In the first place, as a formal and normative theory, in the sense 


discussed in the introduction, information theory does not lead to 
deductions amenable to critical test, where precision of en 
might be vital. Rather, it establishes boundary conditions for 
efficient coding. In general, it is the observed large deviations rom : 
model and the search for their sources that have proven of most interesi 
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to psychologists and engineers who have applied information theory in 
their work. 

In the second place, as Fano has demonstrated [8], the statistical 
characteristics assumed a priori can be quite different from the fre- 
quencies actually observed without the efficiency of transmission or 
coding being lowered very much. This arises from the fact that H 
varies rather slowly with any one of the p(i) unless that p(i) is close to 
zero or unity. In a similar fashion, the fundamental coding theorem 
assumes an unlimited delay, or mésage length, to achieve the optimum 
representation given by H. Yet we were able, in our illustration of coding 
procedure, to achieve a better than 99 per cent efficient code by con- 
sidering messages only out to length three. 

On the assumption, then, that applications to empirical data are 
to some extent tenable, information theory has played a number of roles 
in psychology. Not the least of these has been that of a general reorienta- 
tion in thinking about various behavioral problems. It is very easy to 
substitute psychological terms into the verbal structure of the model, or 
to extend the technical terms of the formal theory to describe the classic 
concerns of the psychologist. Thus, the inputs and outputs of a channel 
are easily and naturally translated into the stimuli and responses of the 
organism. Proceeding in the other direction, as Crossman [5] suggests, 
we can say: “Instead of a stimulus causing a reaction when the threshold 
is exceeded, we now think rather in terms of a signal which may be 
obscured by noise, providing the information needed to select a re- 
sponse.” 

Verbal recodings of this sort are not simply playful. As suggested 
in the introduction, they influence the choice of variables and the design 
of experiments. Crossman, for example, continues his discussion by 
pointing out that, unlike a stimulus, a signal (which should be regarded 
as the output of a transmitter as we have defined it) implies a set of 
alternatives and thus emphasizes the effect on behavior of what might 
have been as well as what is immediately present. Furthermore, a signal 
in this sense functions purely as the basis for response selection. It can, 
according to the theory, be coded into a variety of physical forms and 
embedded in a variety of signal sets, without effect on its selective 
function, 

The respective dependence and invariance suggested by this re- 
formulation of the basic psychophysical problem have stimulated a great 
deal of research in recent years [4, 5, 6, 7, 15, 16, 18, 19, 22, 23, 28, 29, 
30, 31]. This work has been directed to the study of human information 
processing and considers, in Miller’s [23] terminology, the “transmission 
situation.” The observer is regarded as a “channel” and the interest has 
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been in estimating his channel capacity under various conditions and for 
various input stimuli. 

The apparent contradiction in terms—channel capacity is defined in 
the theory as the maximum transmission for all possible input sources— 
is avoided by the intuitively reasonable assumption that man is a multi- 
channel system. Because the information measure is invariant under 
operations of coding, it becomes possible to compare information trans- 
mission among these various channels and to determine the efficiency of 
various coding schemes. For example,"the same amount of information 
can be presented to an observer by exchanges between the number of 
symbols per second and the amount of information per symbol. Similar 
exchanges can be made by distribution of the input simultaneously over 
different channels, or by permitting different aspects of the stimuli to 
vary and hence “carry” information. 

Studies like these which are directed at optimizing the display of in- 
formation in the sense of maximizing the amount of information trans- 
mitted have been of considerable value to the growing field of engineer- 
ing psychology. Perhaps, of most importance, the results of these studies 
indicate the existence of a minimal discriminative difference, proportional 
to the range of variation and invariant for different discriminable aspects 
of the stimulus, This is the stuff of which substantive theory is made, and 
indeed, it has suggested hypotheses regarding the manner in which people 
Process, or organize, the sensory inputs from their environment [22]. It 
must, however, be emphasized that such developments are not an ex- 
tension of information theory. The theory simply provides a conceptual 
framework and set of measures by means of which we can analyze a 
number of diverse situations. Any “model building,” in the usual sense, 
must develop from the empirical results obtained. 

As we have attempted to show in our development, the theory deals 
primarily with the coding problem and in particular with the question 
of optimal coding. Such a theory requires some measure of variation 
and of contingency, and Shannon has demonstrated that H and T are 
peculiarly appropriate for the analysis of a communication system, which 
is his main interest. The abstract structure of such a system is, however, 
much more general than the terminology associated with the theory 
might suggest, and mean logarithmic probability measures are applicable 
to many situations where terms like “information transmission” are 
awkward or quite inappropriate. : 

Consider, for instance, the conditional probability matrices of Fig. 
3. These present the possible relations between an input and an output, 
or, as we have suggested, between stimuli and responses. More generally, 
we might say that these tables represent the various ways in which any 


= 
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set of events can be mapped into another set of events. Phrased this way, 
it is apparent that we could also consider mapping the set that we are 
concerned with into itself. Here x and y represent the same set of re- 
sponses and the contingency could be sequential or temporal dependence, 
In this case, we would read for the conditional probabilities pe (y), “the 
probability that the response x is followed by the response y.” As x and y 
are representative of the same set, we can adopt our earlier notation and 
represent the alternative responses and their probabilities by p(i), where 
Ree en, , k, the set of alternative responses recorded, As 
before, p (ij) is the probability of the responses i and j occurring in that 
order and equals (i) pi(j). It should then be apparent that, by a some- 
what devious route, we have arrived back at the point in our earlier 
development where we considered the definition of H for noninde- 
pendent source selections. In short, H may be used as a measure of the 
internal dependencies, or degree of patterning, within a sequence of 
responses. 

This result is obtained, much more elegantly, by Shannon in a 
theorem which shows that a series of approximations to H can be ob- 
tained by considering Sequences extending over 1, 2, . . . , n symbols. 
The estimates of H thus obtained are a monotonic decreasing function of 
n. The function will equal H in the limit, or when the length of sequence 
considered has exhausted the statistical dependencies within the sequence. 

It is thus apparent that information theory offers a ready-made tool 
for the description of behavior patterns and the analysis of sequential 
behavior [13, 19, 23, 25]. In this connection, it appears to be most con- 
venient to consider the relative uncertainty of the sequence, which is the 
ratio of the obtained value of H to the maximum value that would ob- 
tain for the same set of alternatives if they were equally likely and 
occurred independently. One minus this quantity is the percentage of 
redundancy in the Sequence, and in application to response sequences, 
redundancy may be used as a measure of the degree of stereotypy, or 
organization, of the observed behavior. 

In fact, informational measures have been little used in the descrip- 
tion of behavioral Strategies and organization except in the study of 
language [26, 27, 33]. As suggested in the introduction, the effect of 
the theory may well be indirect. Psychologists have long recognized that 
Successive responses are seldom independent. Information theory, or more 
Properly the probability model on which it is based, explicitly recognizes 
sequential dependencies and demonstrates that they can be taken into 
account in a reasonable fashion. Again, the particular formulation of the 
theory may not be optimal, and indeed, in a particularly simple demon- 
stration of the effects of an anxiety-producing stimulus on the pattern of 
responding [12] an alternative time series analysis was employed. How- 
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ever, it is exactly these peripheral effects that give power to the formal 
theory, emphasizing its generality as well as the distinction between in- 
formation measures and the mathematical theory of communication for 
which they were originally derived. 

As we have indicated, information measures are applicable wherever 
the data in which we are interested can be arranged into a contingency 
table. So are a number of other measures, and the common logical 
structure underlying them has been pointed out in papers by McGill [21] 
and Garner and McGill [17]. 

The similarities between information analysis and the analysis of 
variance first became apparent as a result of McGill’s work on multi- 
variate information transmission [20]. He points out that the basic 
equation for information transmission essentially decomposes a measure 
of response uncertainty H(y) into a part determined by the stimulus 
T(x,y) and a part, H»(y), which consists of unexplained variation. 
There are cases, however, in which some portion of this residue H.(y) 
might be accounted for by the action of another variable in the stimulus 
situation, and it would be useful to extend the definition of transmitted 
information to include two sources u and v. This can be accomplished by 
replacing x in the equation, 


T(xy) = H(x) + H(y) — H(x,y) 


with the symbol (uv). If we eliminate the extra parentheses, we can then 
write: 


T(u,v:y) = H(uv) + H(y) — H(u,v,y) 


where T (u,v sy) can be regarded as a measure of the association between 
the variables u, v, and y. 

McGill then shows that it is possible to separate the dependence of y 
on u and on v in a fashion completely analogous to the way in which 
the variance of a criterion variable can be partitioned and associated 
with the predictor variables, certain interactions, and a residue, or “error 
variance.” As with analysis of variance, informational analysis can be 
extended to any number of sources or predictor variables. Because in- 
formational measures require no assumptions about the metric prop- 
erties of the variables involved, informational analysis can be regarded as 
a nonparametric analysis of variance. However, just because variance 
is metric it is obvious that information and variance cannot, in general, 
be related by an equation. The relation rather derives from a formal 
Similarity which is explored by McGill and Garner in the papers cited 
above. 

Furthermore, this isomorphism extends to correlational analysis. In 
all three methods, the attempt is made to account for the variation of 
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some variable by the variation of one or more predictor variables. 
Following McGill’s notation we can write 


M(y,x) + M,(y,w) 
M(y,w) + Mu(y,x) 


where M(y;w,x) is a generalized measure of the amount of variation in 
the dependent variable y that can be predicted from w and x. As in- 
dicated, this variation can be analyzed in two different ways. The im- 
portant point is that the effects of any particular variable are analyzed 
only after removing, or fixing, the influences of variables studied 
previously. 

The general structure of the analysis is illustrated by the formulas 


T(y320,%) = T(y,x) + T(y,w) 
V(y;w,x) = V(y,x) + Valy) 
Viw) = V(y,x) + Vally,w) 

The T measure is transmitted information. The V measure is the 
variance of the subclass means Jw and js around the grand mean j. The 
V* measure is the variance of points on the regression plane or regression 
line, 

McGill has thus been able to generalize the concepts developed by 
Shannon, and in so doing gives us a proper perspective on the mathe- 
matical formalism of information theory. 


M(y;w,x) = 
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INTRODUCTION 


The discussion outline for this set of theoretical papers asks each 
author to do some serious soul-searching concerning “background factors 
and orienting attitudes” which influenced the objectives, methods, and 
content of the system that he describes. This presumably takes for granted 
that the author of the paper is the author of the theory, which in my 
case is not true. The theory that is presented in this paper, or at least the 
model upon which it is based, is very old as age is counted in psycho- 
logical theories, and I do not propose to probe the souls of the engineers 
and mathematicians who were originally responsible for it. : 

Linear frequency theory was presented ready-made to psychologists 
[16]. It:-had been tested and found useful as a description of certain be- 


havior of machines, and engineers who had used this description in the 


design of machine components of man-machine systems suggested to me 
eful as a description 


and to other psychologists that it might be equally us 

of certain behavior of man. The task of these engineers would be greatly 

simplified if the behavior of both man and machine components of such 

systems could be described in terms of the same variables. This suggests 

one reason for my own interest in linear frequency theory. It is always 
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gratifying to a psychologist to know that the results of his investigations 
may be useful to someone who is not a psychologist. 

Perhaps more important, linear frequency theory meets two meth- 
odological requirements that must be satisfied by any theory if it is to be 
successful. First, the theory is formulated in unambiguous language, in 
this case the language of mathematics, in which the rules of transforma- 
tion are so sufficiently explicit and unambiguous that it is possible to de- 
termine whether one statement is or is not a consequence of another. 
Second, there is at least one important class of unambiguously observable 
behavioral events to which the theory is clearly applicable. Linear fre- 
quency theory is applicable to simple movements, a kind of behavior that 
can be observed with maximum precision. A theory that satisfies the 
methodological requirements of a purist and at the same time has im- 
mediate practical implications is all too rare in psychology. 

Analysis of human motor behavior in terms of linear frequency 
theory provides an example of a rigorous theoretical approach to a very 
practical problem, namely, the design and analysis of the performance of 
complex man-machine systems. Some aspects of frequency analysis in 
this context are not new to psychologists. Some of the most fruitful pro- 
cedures in the study of language from the point of view of both scientific 
and social significance are those associated with the application of linear 
frequency theory to speech and hearing. From all the complex response 
configurations which we call spoken language, this analysis abstracts a 
single characteristic, the physical frequencies represented in speech 
sounds. The analysis of speech behavior in terms of this abstract variable 
has made possible a very precise description of speech which has been 
utilized in the design of a wide variety of mechanical and electronic 
systems for communication by means of the spoken word. Scientifically 
the frequency analysis of speech represents one of the most successful 
theoretical treatments of behavior that has ever been developed. The 
social significance of this peculiarly abstract analysis becomes obvious if 
we ask what would happen to our society if all mechanical and electronic 
devices whose design incorporates knowledge of the frequency character- 
istics of human speech were to disappear. Telephone and radio com- 
munication, hearing aids, auditorium acoustics, and many similar devices 
which, together with a human speaker and hearer, constitute man- 
machine communication systems would disappear with them. 

In the absence of the machine components of speech communication 
systems the frequency analysis of human speech would have only aca- 
demic interest: it would probably be of concern only to the purest of 
scientists. Similarly, the frequency analysis of human motor behavior has 
scientific value as a contribution to basic knowledge, but like the fre- 
quency analysis of speech, it has immediate interest because of the prac- 
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tical applications which await this knowledge. In a context of systems 
engineering, it provides an excellent example of a basic scientific analysis 
that, potentially, can contribute very significantly to the solution of prac- 
tical problems, 


DESCRIPTIONS OF MAN-MACHINE SYSTEM PERFORMANCE 


A wide variety of formal procedures has been developed by systems 
engineers and others for the analysis and description of complex organiza- 
tions or systems, ranging from the relatively simple and empirical repre- 
sentations of static aspects of systems such as wiring diagrams and 
organization charts, to highly theoretical analyses of dynamic or time- 
varying characteristics. Two of the latter, information theory, associated 
with digital computers and discrete-variable communication systems, and 
servo theory, associated with analogue computers and continuous-variable 
amplifying systems, are of special interest to psychologists since they pro- 
vide mathematical models which show promise of adaptation to aspects 
of human behavior. Applied to the analysis and design of systems, these 
two approaches have much in common, although their similarities are 
obscured by differences in the vocabularies associated with each and by 
the fact that certain characteristics of the systems to which they are 
likely to be applied are quite different. 

In both approaches, the emphasis is upon the precision of relation- 
ship between input and output variables rather than upon the efficiency 
of energy and materials utilization which has been the major consider- 
ation in traditional engineering design. Both are concerned primarily 
with the tracing of signals through complex devices or organizations 
within which the signal may take many different physical forms: the 
signal may be manifested as the opening and closing of switches or of 
relay connections or hydraulic valves, as human speech, as changes in 
temperature, voltage, or pressure, as linear or rotational movements, etc. 
The practical effectiveness of both theories for the analysis of the func- 
tioning of complex systems results primarily from their reduction of a 
great variety of superficially quite different concrete events to a single 
abstract variable—a signal characteristic which may be followed through 
many physical transformations. For information analysis, this “system 
variable” is information, expressed in terms of the unit bit and as a time 
function as bits per second. For servo theory, the comparable variable 
is change in any of many empirical variables expressed as a time func- 
tion whose component units are single-frequency sine waves. 

Differences between the two approaches are associated with this 
difference in basic variable. The bit of information theory is empirically 
represented as the occurrence of one of two possible events. It is best 
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adapted to systems within which the obviously important events can 
conveniently be recognized and described as discrete states, and in which 
details of intermediate or transitional states can be ignored. Information 
theory has been applied, for example, to the design of digital computers 
and to those parts of telephone circuitry concerned with making connec- 
tions. In such systems, the basic component is a device similar to a switch 
or relay and the data represent the occurrence of discrete events or 
operations. The performance of the system is evaluated in terms of ac- 
curacy and speed with which discrete operations are performed. Am- 
biguous data, classified according to the principle of the excluded 
middle and transitional states, e.g., the detailed path taken by a relay 
armature as it moves from open to closed position, are ignored except as 
the total time required for the transition affects the speed with which 
the system can perform its function accurately. Applied to human be- 
havior, information theory is best adapted to the theoretical simplification 
of decision processes manifested in responses such as button-pushing, 
switch operations, telegraph, typewriter, and piano key pressing, neuron 
action, and in molar and qualitative units of language such as letters, 
numerals, and words. Servo theory, on the other hand, is applied to 
systems or parts of systems in which data are conveniently treated as 
continuous variables. In such systems, the signals are wavelike and the 
details of continuous changes in state are of importance. Examples of 
continuous-variable systems are a-c circuits, including human speech and 
the speech-transmission components of telephone circuits, and steering 
and aiming devices which often include human operators as components 
performing the function called tracking. 

In systems whose operations involve discrete events, accuracy of trans- 
mission is attained by procedures similar to those which contribute to the 
precision (and the limitations) of two-valued logic: alternatives are 
chosen so as to be maximally distinguishable and intermediate values are 
arbitrarily excluded. Input signals such as words or other language units 
introduced into the systems are engineered or coded in such a way as to 
minimize the occurrence of intermediate signals or responses to such 
signals which produce error. Operational error is reduced by this tech- 
nique and by the use of components which respond to or transmit alter- 
native signals with maximum reliability. Since wires, switches, and 
relays with the necessary high reliability are available, the digital systems 
that incorporate them as components have what is called “intrinsic” pre- 
cision. A similar intrinsic precision is not a necessary characteristic of 
continuous-variable systems—in fact these may almost be said to be 
characterized by an intrinsic inaccuracy. A system which transmits or 
performs other operations upon continuous variables might be expected 
to increase in accuracy as its Sensitivity to small variations increases. But 
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as its sensitivity to signals increases, so does its sensitivity to irrelevant 
characteristics of the signal introduced by imperfect transmission within 
the systems or by imperfect shielding of the signal from external influ- 
ences. Scientists who have worked with maximally sensitive recording 
equipment are well aware that maximum sensitivity does not guarantee 
maximum accuracy of transmission of the signal they may wish to record. 
In the design of servo systems sensitive to continuously varying signals, 
this paradox of sensitivity is associated with the fact that a servo system 
is an amplifier which (a) may multiply error in a signal just as effectively 
as it multiplies relevant characteristics of a signal and (b) may itself 
introduce the error which it multiplies. Examples of such error are the 
vacuum tube noise of electronic amplifiers and the lag and overshoot re- 
sulting from the inertia of massive parts of a system such as motors and 
gears. 


THE FEEDBACK CIRCUIT 


Servo theory makes use of two basic techniques for the elimination of 
error, the feedback circuit and selective frequency amplification. The 
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feedback circuit is a structural characteristic of certain systems in which 
a signal follows a path which when diagrammed appears as a loop. There 
are many forms and uses of the loop circuit, perhaps the simplest of 
which is the negative feedback loop. Having passed through all or part 
of the system, the output signal or a suitable function of it is returned 2 
a different path to a device which compares it with the input. The a 
ference, which is the error of the system, is reversed in sign, and adde 
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to the input. (This reversal is indicated by the term “negative feed- 
back.”) In effect the output of an ideal feedback system of this sort 
is a negative function of its error: it responds so as to reduce error and 
it stops responding when error is zero. 

In its simplest form the negative feedback loop may easily be demon- 
strated as in the following example: 

Example 1. Let you, the reader, function as a signal amplifier, with 
a vertical line across the room providing a signal to be amplified in en- 
ergy by the arm movement involved in pointing at it with a pencil. Hold- 
ing the arm to one side, look at the line, close your eyes, point at the 
line, then open your eyes and correct the pointing error. A few repeti- 
tions will illustrate the increase in accuracy which can be obtained by 
adding a reasonably well-designed feedback loop to a straight line (open 
loop) amplifier circuit. In this example, looking while pointing completes 
a feedback loop from output to input comparator. The open and closed 
loop systems are represented schematically in Fig. 1. 


FREQUENCY ANALYSIS 


Although the notion of the feedback loop is extremely useful in the 
design and analysis of dynamic systems, it actually represents only a 
small part of the theory of such systems—chiefly the nonquantitative 
part which can be represented in a schematic diagram, Its primitive 
character as theory is indicated by the fact that the use of negative feed- 
back does not at all guarantee the improvement in performance sug- 
gested by our example. If components are improperly chosen or matched, 
the addition of the feedback loop to a functioning system may produce 
a wild oscillation in response to certain signals, increasing rather than de- 
creasing the performance error. The real power of servo theory in the 
design of systems which amplify with a minimum error is based upon. 
a far more detailed mathematical model of system operation provided by 
a combination of frequency analysis of the signal with a description of 
the system and its components in terms of equations such as those ac- 
companying Fig. 1. A system is described theoretically as a set of math- 
ematical operations performed (in an order specified by the nature of 
the circuit network connecting the components which perform each 
operation) upon an input signal described as the sum of a set of sine 
functions of various frequencies, amplitudes, and phase relations. The 
“system equation” of the form O=f(1) relating output and input is 
obtained by appropriate combination of the equations representing each 
component. It is important to note that the equation for each component 
has the same basic form as the equation for the system as a whole—its 
output is some function of its input. Any component in a system may 
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itself be a complex system whose internal operations can be ignored if 
the summarizing input-output function is known. In this case the engi- 
neer calls it a “black box,” an exact synonym of the psychologist’s “empty 
organism.” 

The value of describing the signal in terms of its frequency character- 
istics may be illustrated by an extension of Example 1, making use of 
some dynamic relationships intuitively familiar to everyone, namely, the 
input-output function for the simple pendulum. 

Example 2. In the previous example, the machine component is the 
relatively rigid pencil which transmits arm movements essentially with- 
out change or delay, and the feedback loop is completed through even 
more rigid light rays. In consequence the performance of the system is 
essentially equivalent to the transmission characteristics of its human 
component. In Example 2, we shall hypothetically modify this system to 
emphasize the characteristics of a machine component. Let us hang a 
long and heavy free-swinging pendulum from the pencil and consider the 
movements of the pendulum bob to be the output of the system. The 
pencil but not the pendulum will be visible to the human operator. We 
shall make the simplifying (and for some situations involving only low 
frequency inputs, correct) assumption that the pencil follows the move- 
ment of the line with negligible error. For the portion of the system be- 
tween the vertical line and the pencil supporting the pendulum, we shall 
assume that the operation represented by f in the equation O = f(T) 
is simply multiplication by a constant, 1.0. In this case the critical per- 
formance characteristics of the system are essentially those of the pendu- 
lum. The pendulum is, of course, a very simple machine, but it is one 
which exhibits most of the basic characteristics of all machine compo- 
nents in man-machine systems. It has an input, movement of its support- 
ing pivot, and an output, movement of its bob; it has inertia; its move- 
ments are subject to damping, etc.; and all these characteristics may be 
known and subject to manipulation by the design engineer. 

If the pendulum bob weighs, say, 10 pounds, the wire it is suspended 
on is 10 feet long, and its pivot is relatively friction-free, the re- 
sulting man-machine system will follow irregular movements of the verti- 
cal line with negligible error only if the movements of the line are very 
slow and smooth. If the line jerks to a new position and stops, two kinds 
of error will develop, both associated with the inertia and elasticity of 
the pendulum: first a lag as the pendulum bob falls behind, and second 
an overshoot as the bob catches up and swings past. The bob will then 
continue to oscillate around the line, slowing down and reducing error 
to an acceptable minimum only after some lapse of time as a result of 
friction and the viscous damping of the air. If the line oscillates regularly 
at certain frequencies, the pendulum will begin to oscillate at the same 
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frequency with increasing amplitude ( resonating) to produce very large 
error by exaggerating the amplitude of the input. If the line oscillates 
at higher frequencies, the pendulum may stand stationary, producing 
errors of a different kind. There is, of course, an infinite variety of pos- 
sible movements the line might make, and to each of these the pendulum 
would respond differently. Should we wish to describe the performance 
of this machine in great detail we could go on indefinitely at the empiri- 
cal level describing the possible movements of the line and the response 
which the pendulum bob makes to each of them. We could also deter- A 
mine a comparable set of these input-output relationships for each of all 
possible values of bob weight, wire length, air density, and pivot friction 
and for every combination of these and of any other variable which 


| Critical 


Amplification ratio, O/T 


Input frequency 
Fic, 2 


If we examine only the performance of the pendulum or machine 
component in Example 2 (which in this case is essentially equivalent 
to the performance of the system as a whole), we find that tremendous 
simplification of its input-output relationships can be accomplished by 
describing the input, movements of the pendulum support, and the out- 
put, the movements of the bob, as the sum of a set of sine functions of 
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varying frequency, phase relations (relative displacement in time), and 
relative amplitude. When inputs are classified according to these variables 
and the corresponding outputs described similarly are examined, a num- 
ber of regularities become apparent. For any simple pendulum, for ex- 
ample, inputs composed of single sine wave form oscillations whose 
frequencies are below a certain critical frequency are reproduced in the 
output almost exactly, i.e., with the same frequency and amplitude and 
only slightly and predictably delayed in time. Input frequencies at or 
near the critical frequency appear in the output with increased ampli- 
tude, and input frequencies above the critical frequency are reproduced 
with decreased amplitude, approaching zero amplitude as frequency 
increases. These relationships may be summarized in a “frequency re- 
sponse curve” such as that in Fig. 2, which might be obtained for an un- 
damped pendulum if amplification ratio were plotted as a function of 
input frequency. The general form of this curve is characteristic of the 
response of a large class of machines. 


LINEARITY 


Although the summary of the performance of a machine provided by 
a function relating amplification ratio to the frequency of simple sine 
form inputs would probably be useful, it has very limited applicability, 
since inputs in the form of simple sine waves are relatively rare—they 
represent only a small sample of the possible inputs to a system. However, 
tremendous generality for this description of the behavior of a machine 
is developed when it can be shown that, for any input which is the sum 
of two or more sine waves, the output is the sum of the responses made 
by the machine to waves of each of these frequencies separately, i.e., 
when each sine wave component of the input is reproduced as a com- 
ponent of the output with its amplitude multiplied by the factor (O/T 
ratio) associated with its frequency. A system which has this additive 
characteristic is called a linear system. This meaning of the term “linear” 
is not to be confused with the more familiar trade usage which describes 
an amplifier as linear between certain frequency limits when it is linear 
in the additive sense and satisfies the added restriction that its amplifica- 
tion ratio is a constant within the specified limits. 

Since a motion of any form can in principle be reduced to a set of 
sine form components of appropriate frequencies, amplitudes, and phase 
relations, the frequency analysis of the performance of linear systems has 
great generality, limited in application only by the requirement of line- 
arity for the system in question and by the availability of practical tech- 
niques for the frequency analysis of input and output into their compo- 
nent sine waves. Formal methods of analysis applicable to certain non- 
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linear systems are available but have not been developed to the extent 
that they are widely used in systems engineering. 


APPLICATIONS OF LINEAR FREQUENCY THEORY 


Before examining the application of linear frequency theory to 
human motor behavior, let us look briefly at some of its practical uses in 
the design of systems, since these contribute to an understanding of the 
questions concerning human behavior which are relevant in terms of this 
theory. First, notice that the curve in Fig. 2 indicates that the machine it 
describes will act as a low-pass filter. Any input frequency in the range 
¢ will be transmitted with decreased amplitude and any frequency in the 
range d will be eliminated entirely as the signal passes through the system 
or component represented. This characteristic of a machine may be 
utilized to correct certain “errors,” e.g., to eliminate unwanted high- 
frequency oscillation.. This correction is made at the expense of sensitivity 
to corresponding frequencies in the input, which may or may not be 
desirable. It is desirable to exclude high-frequency “static” produced by 
atmospheric electricity in a radio transmission system but to do so by the 
means we have just described undesirably reduces the fidelity of repro- 
duction of the input introduced at the microphone by the musician or 
speaker. A low critical frequency is desirable in the steering mechanisms 
of an automobile in order to reduce or eliminate the high-frequency 
tremor of the driver’s hand or bumps in the road, but the critical fre- 
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FREQUENCY THEORY AND HUMAN BEHAVIOR 


These examples suggest some of the uses for frequency analysis in the 
design of systems, Aside from the manipulation of the connecting net- 
work which determines the flow of signals and the interaction of compo- 
nents, e.g., in feedback loops, a system is constructed to perform its re- 
quired function by selection of components with the desired frequency 
response characteristics. If the response characteristics of available com- 
ponents are known, the response of the system as a whole to any input 
can be calculated and the performance of alternative systems may be 
compared before their construction. This fact, which has very great 
economic consequences, has led to the questions concerning the fre- 
quency characteristics of human behavior which the systems engineer has 
asked the psychologist to answer. The design engineer's decision to use 
a human component rather than a machine component in particular 
systems can be made with confidence only if the response characteristics 
of the human component as well as machine components are known. In 
order to permit a meaningful comparison of potential human and ma- 
chine components, however, the response characteristics of both types of 
component must be expressed in terms of the same variable which in this 
case is the time function specified by frequency analysis. This fact, it may 
be mentioned parenthetically, suggests the reason that engineers respon- 
sible for the basic design of dynamic man-machine systems have largely 
ignored most of the psychologists’ carefully collected data on human 
motor behavior. Most of the research performed by psychologists in this 
field utilizes time-on-target or integrated error score as a dependent vari- 
able. Information provided by such research is completely useless to the 
design engineer. In the case of either score the response is integrated 
over time and frequency analysis is applicable only to the unintegrated 
data, i.e., to the response as a detailed function of time. Psychologists, 
including the author, have often gone to a great deal of trouble to con- 
struct integrating scoring devices for use in motor behavior studies. From 
the point of the design engineer, these devices destroy the aspect of the 
signal which is of greatest value to him, namely, its time-varying charac- 
teristics, including the possibility of determining its component frequen- 
cies. Once a time-varying signal has passed through a device which inte- 
grates over time, the crucial time function is irretrievably lost. 

In requesting information concerning the frequency characteristics 
of human behavior, the systems engineer has in effect handed the psy- 
chologist a mathematical model and suggested coordinating definitions 
which are sufficient to convert the model into an S-R theory. These 
definitions specify input and output as a tracking task (stimulus) variable 
and a tracking response variable, time functions of each of these, a rela- 
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tionship between them, and a set of limiting conditions. If counterparts 
of the relationships between input and output variables specified by this 
model can be observed consistently between stimulus and response of 
man under any single set of limiting conditions, then the model becomes 
an empirical law of human behavior. If relationships described by the 
model are found for many different situations, e.g., for many tracking 
responses, task inputs, and limiting conditions, and these relationships are 
adequately systematized, then the model becomes a theory of human be- 
havior. (Since the model in question systematically describes relationships 
among a variety of input and output variables of a variety of machines it 
is already properly described as a theory of machine system behavior.) 
The limiting conditions mentioned above are variables that are known to 
affect the relationships described by the theory but are not necessarily 
included in it as variables. In a given application of the model, they 
must be held constant or to a range within which their effect is negligible. 

A simple and most concise Statement of the general form of the 
frequency analysis model was prepared several years ago at my request 
by Dr. David Gilbarg of the Graduate Institute for Mathematics and 
Mechanics at Indiana University and published together with a com- 
mentary in which I attempted to present some applications to tracking 
behavior in nonmathematical language [6]. Since Dr. Gilbarg’s presenta- 


tion is brief and I could not improve upon it, I shall reproduce it 
verbatim: 


The method of operational and frequency analysis described in the body 
of this report is based on certain mathematical facts which are roughly 
summarized as follows: If the response of a linear system to certain “ele- 
mentary” stimuli or inputs is known, the response to any input, however 
complex, can be predicted. The purpose of this Appendix is to formulate 
these mathematical facts in a form in which they can be used for actual 
calculation. 

Let a system, such as the human operator, transform an input function, 
f(t), into the output, g(t). For the purpose of brevity we shall state this 
symbolically in operational notation, T[f(t)] = g(t), where T is the “oper- 
ator” which transforms f(t) into g(t). The system is called linear if, for 
arbitrary inputs fı(t) and f,(t), and constants a, b, 


Taft + bfo) = aT (fr) + bT (fs) = agı + bgs (1) 


where g;(t), go(t) are the outputs corresponding to f,(t), f2(t). We shall 
demand, in addition, of the linear system, that if f(t, w) isa family of inputs, 
depending on the parameter w, (e.g., a family of sine waves, sin wt, of dif- 
ferent frequencies), then, 


eke a(w)f(tw)dw] = f°" a(w) Tiwo = SE awetw)dw (2) 
where T[f(1,m)] = etw). 
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The two classes of “elementary” inputs most frequently used in analyzing 
linear systems are (1) the sine waves, and (2) the step functions. The 
analysis based on these two classes is discussed here. 

Let F(t) be an arbitrary input function. If F(t) satisfies certain general 
mathematical conditions, it is possible to express it as an integral sum of 
sines and cosines by means of the Fourier integral theorem,* as follows: 


1 te 
Let alw] = = f F(x) sin wx dx 


+» 
b(w) = 1 [ F(x) cos wx dx 


_. 


then F(t) = ii (a(w) T(sin wt) + b(w) T(cos wt))dw (3) 
o 
It follows from the linearity of the operator, as expressed in Eq. (2), that 


TIFO] + JF [a(W) T(sin wt) + b(w) T(cos wi) ]dw (4) 


Thus, if the responses T (sin wt), T’(cos wt) are known for every value of w, 
the output T[F(t)] can be calculated by means of this equation. 

If F(t) happens to be a periodic function of period 7, the above method 
of the Fourier integral simplifies to analysis in terms of harmonic com- 


ponents, We have in this case,} 


S 2nnt 2rnt 
F(t) = ao + p (m sin = + bn cos ami) (5) 
n=1 
+r 2 
where ig 2 fi F(x) sin == dx 
TJ-r 


+r 2 
=? | F(x) cos = de; NZ ate 
T T 


and the response to F(t) is 


TIF()] = T(@) + ` [anr (sin am 4 aT (cos =) b 
n=1 


Hence, if the outputs to all harmonics are known, the response to F (t) can 


be calculated by means of this sum. 


If the basis of the analysis is to be the response to step functions, the 


* See, for example, Courant, Differential and Integral Calculus. Vol. 2, pp- 
318ff. 


+ See, for example, 
437ff. 


Courant, Diferential and Integral Calculus. Vol. 1, pp. 
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procedure is as follows: Let u(t) represent the unit step function, 
u(t) =0,t <0 
=1,120 
Let A(t) = T[u(t)] be the response to the unit step input. It will be as- 
sumed that, if the step input is applied at t = r, the response is simply dis- 


placed by the amount r, i.e., T[u(t — r)] = A(t — r). Now consider any 
input function F(t), then the response of the linear system is given by* 


dF(r) 
== A(t — 1)dr (7) 


TIFO] = F(0)A() + I 


Thus, if the response to the unit step function is known, the right hand side 
can be calculated for any input f(t), and the output determined. 

The formulas (3), (6), and (7) are the means of predicting the re- 
sponse of a linear system to any input, (3) and (6) using the response to 
sine waves as basis, and (7) the response to a step function. The two meth- 
ods are equivalent when they are both valid [6]. 


Dr. Gilbarg’s references to the analysis of complex input and output 
functions into step function components may be ignored for our purposes 
here except to say that the general treatment is the same as in the linear 
frequency model and that the two forms of the model are mathematically 
translatable, since a step function can be reduced to a set of sine wave 
components and vice versa. 

Considering only the frequency form of the general linear model it 
is apparent that it is applicable to relationships between stimuli and 
responses which vary continuously in a single dimension. To make this 
application the input function f(t) is defined in terms of a particular 
stimulus variable and the output function g(t) in terms of a tracking 
response, both being stated as functions of time. Empirical tests of the 
resulting theory require a determination of the transform T, which is 
most easily done experimentally by presenting a single sine wave input 
of constant amplitude at each frequency over the range of interest, C8 
from zero frequency to the frequency above which the amplitude ratio iS 
zero, and determining the amplitude of the resulting sine form tracking 
responses associated with each input frequency presented. The transform 
T may be represented by expressing amplitude ratio and phase shift 
(delay in transmission through the system) as functions of input fre- 
quency. Once having determined the transform, a variety of complex 
input functions f(t) are presented and the corresponding output func- 
tions ¢[f(t)] are computed and compared with observed outputs g(t). 
To the extent that there is agreement between computed and observed 


* See, for example, Karman and Biot, Mathematical Methods in Engineering. 
p. 403. 
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outputs the condition of linearity is satisfied, and the theory is verified 
for the set of limiting conditions represented in the empirical test. 


EXPERIMENTAL TESTS OF FREQUENCY THEORY 
FOR HUMAN BEHAVIOR 


The experimental conditions under which this theory has been tested 
so far are relatively uniform and, from the point of view of the subject, 
rather simple. The typical experimental apparatus displays two indica- 
tors, such as a pointer and a vertical line, one or both of which move 
horizontally. The subject manipulates a control, usually a stick or lever, 
sometimes a wheel, movement of which affects the position of one of the 
indicators, His task is to keep the two indicators continually in alignment. 
There are two basic types of tasks, pursuit and compensatory tracking. 
In pursuit tracking, an input indicator moves independently of any ac- 
tion of the subject. Using his manual control, the subject moves the 
second or output indicator so as to follow the input, attempting to 
minimize the error or discrepancy between the positions of the two 
indicators. Movements of the output indicator or of the subject’s manual 
control may be treated as output for purposes of analysis. Aiming a 
hand-held gun at a moving target is a nonlaboratory example of pursuit 
tracking, the target and sight corresponding to input and output indi- 
cators respectively and the gun itself to the control. 

In compensatory tracking, one indicator is fixed with respect to the 
tracker, usually in the center of his visual field. The movements of the 
second indicator are in effect the algebraic sum of an imposed input and 
some function of the output, the control movement of the subject. Be- 
ginning with the two indicators superimposed, if the subject holds his 

` control stationary any input produces error in the form of a discrepancy 
between the fixed and moving indicators. The subject’s task is to move 
his control and the indicator controlled by it in such a way as to 
minimize or compensate for the error. Aiming a rotating turret 
with the human operator inside is an example of compensatory track- 
ing. A sight mounted on the turret provides the fixed indicator; the 
target functions as a moving indicator, its movements providing an 
input. The subject’s manipulations of the manual control device rotate 
the turret so as to reduce discrepancies between the positions of target 
and sight. 

From the point of view of frequency theory, the two types of track- 
ing are mathematically equivalent and are therefore theoretically inter- 
changeable at the convenience of the engineer or the experimenter. Psy- 
chologically, the two might appear to be very different tasks, but al- 
though both have been used in experimental work (usually in different 
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laboratories), no significant difference in the frequency characteristics 
of tracking behavior because of this factor has so far been demonstrated. 
However, the apparent absence of behavioral significance of this varia- 
able may well be due to the crudity of methods used so far. 

Since the basic theoretical problem concerns the relationship between 
input and output as time functions, precise knowledge of the input 
characteristics is required. Inputs used experimentally have chiefly taken 
three forms: 

1. White noise, in which the distributions of frequencies, amplitudes, 
and phase relationships of components are defined only statistically. 

2. Complex waves composed of a few sine-form components of 
known frequency, amplitude, and phase relationships. 

3. Simple sine waves of constant frequency and amplitude. 

A number of technical problems which will not be discussed here are 
associated with the decision to use inputs that are known statistically 
rather than inputs that are fully defined. The frequency analysis research 
performed at the Indiana laboratory has been restricted to inputs of 
known composition, 

In practice, empirical application of the linear frequency model to 
human tracking behavior has not been at all simple. A major difficulty ° 
is the magnitude of the data reduction problem, which exceeds that to 
which psychologists are accustomed. Adequate treatment of a two-minute 
sample of tracking may require several hundred hours of hand computa- 
tion. As one consequence, large-scale studies are not common: a single 
study sponsored by the United States Air Force and carried out under 
the direction of Dr. O. J. Benepe [2] probably includes half the available 
data on tracking behavior of the human operator which has been ana- 
lyzed according to the requirements of frequency analysis. This study is 
based on 221 power spectra obtained for 99 subjects, together with a 
number of summarizing analyses based on autocorrelation techniques. 
Most other studies have been limited to data obtained from one or a few 
subjects or to less detailed analyses. 


LINEARITY OF HUMAN RESPONSE 


The results of empirical studies have so far been largely negative. It 
may be stated categorically that linearity of response over the entire 
range of interest for tracking input frequencies (from zero to approxi- 
mately 10 cycles per second) has not been demonstrated for any set of 
limiting conditions. In addition there is ample evidence of nonlinear 
relationships between tracking stimuli and tracking responses. However, 
it does not follow, as some have concluded, that man is a nonlinear sys- 
tem, We would not conclude that a pendulum was a nonlinear system if 
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we tested it outdoors in a thunderstorm and found that the relationship 
between horizontal movements of its pivot and its bob did not satisfy 
the conditions of linearity. We should properly conclude that this partic- 
ular input-output relationship was not linear under the limiting condi- 
tions of the environment provided by a thunderstorm, but we should 
leave open the question whether (a) the pivot-bob movements were 
linearly related under other limiting conditions, and (b) the movements 
of the bob were linearly related to a more inclusive input, including, for 
example, the movement of the pivot and forces applied by the wind. 
This analogy suggests the status of research on the linearity of the human 
operator today. Under the conditions that have been examined there is 
positive evidence of nonlinearity in the relationship between certain in- 
puts and outputs. But only a few conditions have been investigated and 
there are tantalizing hints in the available data that a very high degree 
of linearity may be demonstrated (a) under other limiting conditions 
that have not yet been examined in the laboratory, and (b) under the 
same limiting conditions when more inputs than movements of a visual 
tracking stimulus are controlled or taken into account. 

First, let us examine the evidence of nonlinearity. Some of this evi- 
dence is obvious without resort to laboratory demonstration. We know, 
for example, that we can instruct a subject (a) to track or (b) not to 
track and that he will do as instructed. This is a most obvious change of 
the transform T from whatever it may be when tracking to an amplitude 
ratio of zero for all frequencies when not tracking, indicating that instruc- 
tions are one of the critical limiting conditions. We know also that a man 
can steer a bicycle down a curved path with his hands on the grips (a) 
at the ends of the handle bars or (b) near the steering column, which 
implies different input-output amplitude ratios in each case, i.e., that T 
varies if we define output as hand movement. He can also ride a bicycle 
with his arms crossed, i.e., with his right hand on the left grip and left 
hand on the right one, in which case if output is measured as right-hand 
movement the sign of the transform is changed. It is clear therefore that 
task conditions markedly affect the transform T, indicating that man is 
not a simple system giving a uniform response to the particular portion 
of his environment we call the tracking input. These facts are taken for 
granted in all laboratory research, and the attempt is made to hold in- 
struction and other task-variables constant during any test of linearity. 
However, given apparent constancy of these limiting environmental con- 
ditions there is further evidence of nonlinearity. Linearity requires that 
the same transform be recoverable when inputs differ in composition, 
and it goes almost without saying that this is possible only if the responses 
to inputs of the same composition presented at different times are the 
same. The transform must be a process which is stationary in time, a re- 
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quirement which can be tested by presenting any input of fixed composi- 
tion repeatedly or continuously under constant conditions and comparing 
two or more samples of the responses obtained. Differences in response 
to an input of constant composition, which provides clear evidence of a 
nonstationary T in a highly uniform limiting environment, have been 
demonstrated over a long period of time during which learning occurred 
and also during a relatively short period of tracking after the learning 
asymptote (as indicated by integrated error scores) had been reached 
[2, exp. II and III]. Even more obvious evidence of nonlinearity is pro- 
vided in a study [7] in which inputs consisting of single frequencies 
ranging from 0.5 cycles per second to 4 cycles per second were presented 
at two amplitude levels. It was found that input-output amplitude ratios 
decreased by approximately 10 per cent input amplitudes were tripled. 
Further, at the higher frequencies most subjects responded at frequencies 
higher than the input frequency, an effect indicating a rather startling 
instability of T above the critical frequency. At these frequencies the sub- 
jects acted as though they responded to the amplitude of the input signal 
but not to its frequency: they duplicated the amplitude characteristic 
only. 

Another indication of nonlinearity, related to the last example, is 
the occurrence of “noise,” i.e., frequencies in the output which are not 
present in the input. This noise or error may be random in terms of 
frequency, resembling thermal noise or “static” or it may be especially 
prominent at certain frequencies. An example of the latter type of error 
is the 60-cycle hum in a radio amplifier. Irregular tracking error, familiar 
to all investigators of tracking responses, is evidence of “random” noise. 
A number of investigators have also pointed to a prominent error com- 
ponent at approximately 1 cycle per second which appears in the output 
of the human tracker when it is completely absent from the input. The 
characteristic tremor frequency of approximately 10 cycles per second 
might be expected to appear in human outputs, but this frequency is 
considerably above the critical frequency of most apparatus used in track- 
ing studies and in consequence has seldom been reported as tracking 
error. The noise or error which is identified by spectral analysis in the 
Benepe report, corroborating reports of other analyses, includes “ran- 
dom” error, error with a frequency near 1 cycle per second, and error 
with frequencies Corresponding to those of input components. In addi- 
tion a 5-cycles per second error component appeared under some condi- 
tions, e.g., when other error was suppressed, 

, All of this might appear to be overwhelming evidence that the human 
being performing a tracking task is best described as nonlinear. However) 
it may be pointed out that most of the evidence of nonlinearity just pre- 
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sented is associated with variables not mentioned in the theory. A theory 
which states a relationship between input and output is not invalidated 
by evidence that output is related to other variables. The fact that 
changes in instructions and changes in posture (e.g., in operating a bi- 
cycle) affect output characteristics is irrelevant to the question of line- 
arity. The knowledge that these variables do affect output simply means 
that these are system variables included in § which must be held constant 
in testing for a linear relationship between input and output variables. 
The evidence for changes in T, the input-output relationship, under fixed 
task conditions is restricted to very few of the possible configurations of 
limiting conditions—the actual amount of research investigating line- 
arity in the human operator while tracking is almost infinitesimal in com- 
parison with the amount of research which has been conducted to de- 
termine the response characteristics of machines. And finally, the reported 
tracking error and other nonlinearity of the human operator under many 
circumstances are no greater than those of many machines considered to 
be linear in terms of practical standards. 

The human tracking error is likely to become important in practical 
Situations only when it occurs in high-amplification systems requiring 
the man to respond to maximally high frequencies, and especially when 
the human or scientific or economic or military penalties for even small 
error are great. In such cases we build research equipment which ampli- 
fies the error so that we can examine it in detail. But we should not lose 
perspective merely because we look at the error through a research 
microscope and its consequences through a social or a physical ampli- 
fier. The error which appears as nonlinearity in spectral analysis may be, 
and in published research often is, less than 1 or 2 per cent of the total 
Signal, decreasing relatively as input amplitude increases. Consider the 
expected performance of a man driving a car at a uniform 20 mph, 8 
hours a day from Denver to San Francisco with all traffic cleared from 
the road. With proper instruction and motivation, how often would we 
expect his front wheels to leave the road? Assuming that he could per- 
form this task at least once without ever leaving the road, we have 
described a set of limiting conditions under which the deviations from 
linearity are extremely slight. The ratio of the width of the road to the 
wandering of the road, which represents the maximum proportion of 
nonlinearity under these conditions, is obviously very small, whether or 
not we consider it negligible for scientific or social purposes. Over the 
range of input frequencies defined by the curves in the road taken at the 
rate of 20 mph, a man driving a car is a highly linear system with a 
uniform amplification ratio of 1. Since at 20 mph the path of the car 
is relatively rigidly related to the movements of the steering wheel we 
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may conclude that this measure of linearity of the transform for the sys- 
tem as a whole is equally characteristic of the transform for its human 
component. 

There are many similar situations in which a human output very 
closely approximates a linear transform of a stimulus input. These are 
apparently limited to inputs containing no frequencies higher than ap- 
proximately 0.5 cycle per second. Above this frequency, linearity is more 
subject to question. 

There is one extremely interesting suggestion [2, exp. IV] of limiting 
conditions which result in marked reduction of the error occurring at low 
input frequencies. In a pursuit tracking task, a single sine wave input of 
a cycle per second was presented as a visible pointer movement (a) to- 
gether with auditory stimulation consisting of the uncontrolled ambient 
noise of the laboratory, (b) together with a tone varying in (auditory) 
frequency at 0.25 cycle per second, i.e., auditory stimulation not in 
synchrony with the visual input, and (c) together with a tone varying 
regularly in aduitory frequency at ¥% cycle per second, the variation in 
auditory frequency being synchronized with the movements of the pointer 
providing the visual input. Under condition (c) error was markedly 
reduced, not only that near the input frequency and 1 cycle per second, 
but throughout the entire frequency spectrum examined, from 0.2 to 
6.0 cycles per second, This is an isolated result, but the reduction in 
error is great and is consistent for 16 subjects. It strongly suggests that 
better control of stimulation through other modalities than that receiving 
the arbitrarily defined “input” may contribute to successful application 
of the linear model to human behavior. 

In any case, it is quite obvious that if we consider the relationship of 
a single input to a single response output in the human operator without 
regard for other variables, he is a nonlinear system. But any system 
which has alternative transforms, such as a linear amplifier with an 
on-off switch, is nonlinear if we ignore conditions which select the 


being selected by means of Variables other than the immediate single- 
dimension characteristic of a stimulus arbitrarily called “the” input for 
purposes of a given system or experiment. The first question to be 
answered in applying the linear frequency analysis model to human 
behavior is not “Is man a linear system?” but “Are there any conditions 
under which linearity of input-output relationship can be shown for 
man?” The answer to the latter question is certainly positive in spite 
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least as linear as many of the machines which are considered to be 
linear. Under these conditions of linear operation, the transform T is a 
constant multiplier. The associated error is sufficiently small to be 
negligible for many technological purposes, and there are suggestions that 
the stricter requirements of scientific theory verification may be satisfied 
by more carefully controlled empirical research. 


METHODOLOGICAL COMMENTARY 


Since neither the theoretical treatments of systems-engineering con- 
cepts and their application to human behavior nor the reports of sup- 
porting empirical research, even that conducted by psychologists, are 
generally available in psychological journals, I have tried in the preced- 
ing section to summarize the engineering context, the basic theoretical 
notions, and some of the relevant empirical data, mentioning methodo- 
logical matters only incidentally. In the present section linear frequency 
theory will be examined as an example of a limited theory of the behavior 
of systems. From the methodological point of view, whether the systems 
are called human beings or machines is irrelevant. j 

Linear frequency theory is based on a mathematical model, i.e., a set 
of statements which are completely expressed in conventional mathe- 
matical notation. As a result, generally accepted rules are available 
which permit the derivation of certain statements of relationship among 
variable terms and do not permit the derivation of certain others. Ques- 
tions as to whether a given statement is or is not a consequence, 1€., 
whether it can or cannot be derived from the statements comprising the 
mathematical model, can be answered. The formal model is converted 
to a theory by the addition of a few empirical definitions which assert 
equivalence between (a) empirical operations and (b) terms and logical 
operations in the model or in statements derived from it. The variables 
defined by these operations may be directly observed or obtained from 
sets of directly observed values of such variables by “empirical,” i.e., 
observable, data reduction operations. To the extent that these opera- 
tions are unambiguously described, these definitions make it possible 
to specify that certain empirical consequences are derivable or are not 
derivable from the model. 

There are six basic coordinating definitions utilized to convert formal 
statements of the linear frequency model to an empirically testable form, 
i.e., to a theory. These define 7, an input variable, O, an output variable, 
f(t) and g(t) time functions of each of these, T, the transform stating 
the relationship of f(t) and g(t), and S, the system and the conditions 
under which it operates. i 

The input variable J is an empirical independent variable, defined by 
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operations which provide instantaneous values of a variable such as 
position, distance, pressure, temperature, brightness, auditory frequency, 
etc., which is in the environment of the system. The output variable O 
is the corresponding empirical dependent variable, some response or 
activity of the system whose instantaneous values are measured. Although 
definition of these empirical variables is implied in every application of 
the theory to a particular system, the actual theoretical analysis is based 
on a characteristic abstracted from sequential sets of measurements of 
I and O, namely, their changes as a function of time. The temporal 
changes in J and O are presumably observable if J and O are ob- 
servable, but since direct observations ( estimates) of time functions are 
likely to be highly ambiguous, the precise time functions are redefined 
as data reduction operations performed upon sequential sets of J and O 
measurements. The reduction of ambiguity which results is, of course, 
the primary function of such operational definitions, The particular 
Operations chosen to define f(t) and g(t) are selected in terms of the 
second criterion applied to definitions in science, namely, their use- 
fulness. Particular data reduction operations which have defined f(t) 
and g(t) usefully—in this case by permitting demonstrations of their 


appropriate frequency, amplitude, and in some cases phase relations. 
Two common operations are represented (a) by Fourier analysis, which 
reduces the input and output Sequences to power spectra, and (b) by 
autocorrelation. The data reduction may be performed by the instrument 
used in observation, €.g., a wave analyzer, in which case f(t) and g(t) 
are directly observed empirical variables rather than constructs, and 
I and O, which may not be observed at all, become somewhat vestigial 
constructs, 

In the context of the present theory little more need be said con- 
cerning the variables just discussed. However, it is interesting to note 
that if f(t) and g(t) are obtained by a manual operation—if they are 


g(t) are taken directly off the display of a wave analyzer (which is 
become directly observable variables. 


The two remaining variables T and S have a somewhat different 
Status from those just discussed, In a certain sense, both are constants 


set of mutually exclusive classes, each of which is called a value.) T, the 
relationship between input and output Variables, is asserted to be fixed 
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in any specific application of the theory, i.e., when S, the system and 
its operating conditions, remains unchanged. T as a variable (an inter- 
vening variable?) is a set of sets of mathematical operations and param- 
eters, determined empirically for any given S by analysis of the relation- 
ship between f(t) and g(t). $ is an empirical variable of a sort which 
has not been subject to detailed methodological examination. As a single 
variable representing systems in their environments of operating condi- 
tions its individual values (particular systems) are highly complex and 
qualitative. If we examine in detail any system and its operating condi- 
tions which constitute a single value of the qualitative variable $, we 
find that it can be defined as a configuration, the logical product of a 
large number of simpler variables, some of which may also be qualitative 
variables. Because of the complexity of qualitative variables such as $, 
it is customary in many scientific contexts to solve the problem they 
pose by treating them, temporarily, at least, as constants. The total com- 
plex § is held constant during detailed investigations of relationships 
among a few special variables such as / and O, or g(t) and f(t). This 
practice is followed in science generally as well as in systems-engineering 
research, Once the relationship between input and output has been 
determined for a given system and a given set of conditions, i.e., once 
the T associated with a particular $ has been found and found to be 
constant, the next step is to make similar determinations of other values 
of T associated with other values of S. This is one of the processes in- 
volved in the generalization of a law or a theory. 

Insofar as § is treated as an undifferentiated qualitative variable, 
i.e., simply as a set of mutually exclusive classes, its relationship to dif- 
ferent values of T can only be expressed as a collection of statements of 
association between particular values of S and particular values of T. 
However, as a number of systems are investigated and statements of as- 
sociation between values of 7 and § accumulate, some order may appear, 
Suggesting the possibility of summarizing a set of relationships between 
certain subclasses of § and T. For example, as the transforms of a 
number of pendulums accumulate, it might be noticed that if these 
pendulums (single values of S) are arranged in an order determined by 
the weight of the bob, the critical frequency (an aspect of T) increases 
as weight (an aspect of S) decreases. This analytic process has, of course, 
already occurred in the past history of frequency theory. In the complete 
theory, which we have not attempted to present, since it is coextensive 
with much of general physical theory, S is not treated as a constant, but 
as a variable or a set of variables. The relationships between many 
values of § and T have been determined, and it has become obvious that 
there are systematic relationships between certain aspects of S, such as 
Mass of moving parts, and certain aspects of T, such as critical frequency. 
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These relationships between subvariables in $ and characteristics of T 
have been summarized mathematically in such a way that the complete 
frequency theory of systems includes the relationships between O and I 
or f(t) and g(t) as a function of many specific variables to which the 
qualitative S can be reduced. 

As a result of this process, the art of systems engineering, which re- 
quired the engineer to memorize or “intuit” the T functions associated 
with qualitatively described § values, is being supplanted by the science 
of systems engineering in which relationships between T and many 
possible characteristics of systems are explicitly stated. Insofar as these 
characteristics of the system are quantitative variables they make it pos- 
sible to take maximum advantage of the summarizing power of the 
quantitative language, mathematics, and permit computational solutions 
to many concrete problems of systems design. The later stages of this 
process, the generalization of relationships between quantitative aspects 
of $ and T, have been limited almost exclusively to the machine com- 
ponents of systems. In the investigation of the performance of human 
components we are in a very early stage. We are collecting particular T 
values that are associated with particular qualitatively described S's. 


Linear frequency theory provides a means of organizing the collection 
procedure. 
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APPENDIX: SUGGESTED DISCUSSION TOPICS 
FOR CONTRIBUTORS OF SYSTEMATIC 
ANALYSES’ 


INTRODUCTION 


We will use the term “systematic formulation” as any set of 
sentences formulated as a tool for ordering empirical knowledge with 
respect to some specifiable domain of events, or furthering the dis- 
covery of such knowledge. As is evident in science in general and 
psychology in particular, such formulations may vary in their char- 
acteristics over a very wide range. These variations may reflect dif- 
ferences in the intentions of the systematist, limits imposed by the 
nature of the subject matter, by the status of knowledge about it and 
related domains, by the availability of techniques for ordering the 
events in the domain, etc. 

Defined in this sense, a “systematic formulation” may vary from 
one or a few orienting ideas towards the conduct of research, or 
towards the organization of extant knowledge within a given empirical 
domain (of any scope), to an explicit, elegant, and quantified systema- 
tization. Such highly diverse expressions as “viewpoint,” “research 
philosophy,” “Weltanschauung,” “exploratory hypothesis” or set of 
such, “frame of reference,” “dimensional system,” “systematic (or 
“theoretical”) framework,” “explanatory (or “descriptive”) system,” 
“hypothetico-deductive system,” “theory,” “explanatory mechanism” 
(or set of such), “model,” etc., may all be subsumed under “systematic 
formulation,” as we wish to use this phrase. 

This study is interested in the “systematic formulations” of present- 
day psychological science. Comparative analyses of “theory” and 
discussions of systematic methodology have considered far too narrow 
a range of formulations during the past few decades. We seek an 
inventory of current systematic resources which will adequately re- 
flect the diversity and richness of conceptual experimentation of recent 
and present psychology. Only by the widest possible representation 
of formulations, with respect both to methodological type and em- 

1 This is a copy of the document concerning the discussion themes and their 
significance, sent to all Study I contributors at the time of their invitation to 
Participate. 
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pirical domain, can clear light be shed on problems that cut across 
various classes of “system.” Only in this way can problems which are 
unique to given classes of “theory” be isolated, and interrelationship 
issues be treated justly and comprehensively, 

This study begins with no value judgments with respect to some 
preferred mode of systematization, or even with respect to some 
preferred set of systematic aims or ideals, On the contrary, the only 
value judgment it makes is that issues of this order have tended, in 
recent decades, to be prejudged. Nor is it the intention of this study 
to end with such a set of value judgments. Our intentions are ex- 
plicative, not evaluative, and our belief is that explication of the cur- 
rent systematic situation on a broadened and less stereotype-bound 
basis is as valuable to a rational determination of next steps on the 
part of systematists and research workers as it is to more effective 
pedagogy. 

In this era of second and n-order self-study questionnaires and 
professional nose-counting, investigators whose mode of work is as 
essentially individualistic and inspirational as that of the systematist 
may understandably feel that there is a suggestion of the Philistine 
in any project which requires the answering of questions about their 
work. To this, we can only reply that among the intentions of this 
study are not eavesdropping on the creative process, the determination 
of excellence by ballot, or even the charting of “directions”? by con- 
sensus. We believe merely that where we can go—no matter in how 
many different directions—is some function of where we are, and that 
the assessment of where we are can proceed perhaps a little more 
efficiently in the light of the information for which this study calls. 
The type of reflective re-analysis of one’s position from a common 
incidence which this study seeks finds its precedent in such institution- 
alized channels as symposia, anthologies, handbooks, and the oc- 
casional journal issues which are devoted to a common theme. 


RATIONALE OF THE DISCUSSION TOPICS 


Explicit knowledge about the characteristics of the many and 
varied systematic formulations put forward in the history of science 
is in its infancy, but a reasonable amount of information exists about 
a few of the formulations in natural science (e.g., Newtonian me- 
chanics, relativity theory) particularly distinguished for their gen- 
erality, explicitness, “elegance,” and success in mediating the organiza- 
tion of knowledge. It is highly unlikely that all “successful” systematic 
formulations in all fields of science exhibit all of the known properties 
even in some degree—of the criterion formulations which have so 
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far been studied by methodologists. But it is probable that all formu- 
lations which realize in some measure (whether actually or potentially) 
such scientific objectives as “prediction,” “understanding,” or “con- 
trol” exhibit at least some of these properties. 

The discussion topics in the following outline have perforce been 
derived from the specifiable characteristics of the class of scientific 
systems which has so far received attention from methodologists of 
science. Nevertheless, we have no great confidence in the adequacy 
to psychology (and the biological and social sciences) of the generaliza- 
tions about problems of empirical systematization made by methodol- 
ogists of science. Whether systematizations of psychological data can 
be expected to conform to any large number of such characteristics 
is, of course, an entirely open question. Unfortunately, we do not as 
yet have a vocabulary, and a set of corresponding distinctions, which 
permits us to talk with precision about the widely varying character- 
istics of non-natural science systematic formulations. Given writers 
will therefore find that not all items will be equally relevant to their 
own systematic formulations, and some items will probably be en- 
tirely irrelevant, Depending on the nature of his system, the systematist 
must necessarily give differential attention and emphasis to certain of 
the items. He may also find it necessary to discuss the formulation 
with respect to characteristics not included in the outline. 

Clearly, we are aiming for commensurability of treatment, but 
not blindly or rigidly so. Not only may individual writers find it 
necessary to omit certain of the items, but they may wish, in some 
cases, to re-interpret items in order to bring them to bear more 
precisely on the nature of the formulation under analysis, and they 
may wish to alter the order in which the various discussion topics 
are arranged. Despite such necessary variations of treatment, the 
procedure should result in a more commensurable airing of issues 
connected with systematic formulations than has hitherto been the 
case. 
It would be meaningless to suggest any standard length for the 
manuscripts. Obviously, we should like to have sufficiently sustained 
consideration of the discussion topics to ensure clarity for a hetero- 
geneous audience, and to derive maximum explicit benefit from the 
systematist’s wisdom with respect to the problems at issue. On the 
other hand, we do not wish to burden the systematist with an overly 
laborious or time-consuming task. The purposes of the study will 
be adequately served by manuscripts which are as brief as is compati- 
ble with meaningful discussion of the outline rubrics. i : 

We have tried to formulate the following list of discussion topics 
explicitly enough to ensure univocality of interpretation, yet at the 
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same time to avoid unconscionable discursiveness in our presentation, 
For reasons indicated above, we have used certain of the “standard” 
distinctions and terminological counters of the general methodology 
of science with a reluctance which has only given way because of the 
unavailability of any alternate vocabulary for talking, with general 
intelligibility, about systematic problems. If the authors of such 
distinctions have, in the past, applied them in such a way as to imply 
value judgments based on the degree of correspondence between the 
material under analysis and the analytic distinctions at hand, we can 
only regard this as a regrettable historical circumstance to which the 
results of the present study may conceivably supply the proper 
corrective. Indeed, a useful outcome of the present study might well 


THE THEMES OF DISCUSSION 


{1} Background factors and orienting attitudes 


(a) Background factors which have influenced objectives, methods, 
and content of system. 

(b) Orienting attitudes which have determined systematic ob- 
jectives, methods, and conceptual content. 


Explanation 


“Background factors” would include, of Course, such matters as education, 
influence of other theorists, general currents of thought within the field or 
the culture at large, previous research history, or any other genetic circum- 
stance which the systematist deems noteworthy. 

“Orienting attitudes” register those Presystematic judgments, values, and 
beliefs which, in a relatively general and stable way, have determined the 
aims, inductive basis, conceptual content, or formal organization of the 


system. Examples might be the systematist’s general commitments towards 
such issues as: 


a. the nature and limits of psychological prediction 

b. “level of analysis” at which it is fruitful to constitute explanatory 
Constructs, with respect both to “ontological reference” (e.g., “purely 
behavioral,” “physiological,” “sociological”), and “coarseness-fine- 
ness” of the “causal” or explanatory units 

c. utility and role of “models” 

d. comprehensiveness of empirical reference (in terms of some such con- 
tinuum as “unrestricted generality of scope—extreme delimitation”) 
towards which it is fruitful for a system to aim, in the present phase 
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e. degree and mode of quantitative and mensurational specificity towards 
which it is desirable and/or feasible to aim 

f. type of formal organization (on some such continuum as “explicit, 
hypothetico-deductive axiomatization—informal exposition”) con- 
sidered best suited to requirements for systematization, at the present 
phase, in the area selected by the systematist. 


In order to promote adequate understanding of the systematist’s goals 
and working methods, it would be desirable to make the itemization of 
“orienting attitudes” reasonably complete. 


12} Structure of the system as thus far developed 


(a) Exhaustive itemization of systematic independent, intervening, 
and dependent variables. 

(b) Mode of definition of representative variables of each category. 

(c) Major interrelations among constructs. 

(d) Discussion of order of determinacy and other characteristics 
of construct linkages. 


Explanation 


What is sought here is not a discursive summary of the system, so much as 
a reconstruction of its conceptual structure via the isolation of the chief 
systematic constructs of all categories, and the exhibition of how they are 
interrelated within the system. The presentation need not be particularly 
lengthy, since, for the purpose of the analysis, the systematist need not 
summarize contents of prior expository publications, to any marked extent. 

In order to promote commensurability, we are suggesting that the system- 
atists adhere to the independent-intervening-dependent variable schema 
which has become more or less conventional in recent methodological dis- 
cussion. Since many systematic formulations have not been explicitly pat- 
terned on such a schema, the recasting of the systematic structure in this 
way may present difficulties, but, we suspect, not very formidable ones, in 
most cases. i 

In cases where a systematist feels that an attempt to recast his material 
into the independent-intervening-dependent variable schema does violence 
to his formulation, he may, of course, recapitulate the structure of his 
system in any way that he considers appropriate. 

In certain cases (e.g., “‘positivistic” systematizations), a system may not 
contain conceptual components which correspond in functional significance 
to “intervening variables.” In such cases, the systematist’s task will obviously 
reduce to the isolation of systematic independent and dependent variables, 
and their interrelations. ' P 

For purposes of this study, we stipulate the following rather informal 
definitions of the three classes of systematic variables. 4 

1. The “independent variables” of a system are the terms referring to 

the factors available for identification, “measurement,” and, when 
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possible, manipulation, which are discriminated within the system as 
the antecedent conditions of the events that the system is designed to 
predict. 

2. The “dependent variables” of a system are the terms designating the 
classes of events that the system is designed to predict. 

3. “Intervening variables” are terms interpolated between the inde- 
pendent and dependent variables, having Properties such that a 
class of empirical relationships describable by a given number of 
statements which directly relate independent and dependent variables 
can be derived from a substantially smaller number of statements 
which relate independent to intervening variables and these, in turn, 


(in the first instance) by Tolman for the analysis of theory implies a sense 


of the expressions “independent variable” and “dependent variable” which 


and in general scientific methodology. In order to be entirely clear for the 
purpose of the present study, we present three senses of the expression “in- 
dependent variable” (analogous definitions may immediately be derived 


Terms in the construct language of a theory denoting the chief classes of 
empirical events which serve as the operationally identifiable or “measur- 
able,” and, wherever possible, manipulable antecedent conditions of the 
events that the theory is designed to predict. This is precisely the sense in 
which the present discussion topic calls for the isolation of the “independent 
variables” of the system under analysis. We may refer to “independent 
variables,” in this sense, as “systematic independent variables.” 


SENSE II. EMPIRICAL INDEPENDENT VARIABLES 


confused. Empirical independent variables may be specific, singular “realiza- 
tions” (operational or reductive “symptoms”) of a systematic independent 
variable; they are not, however, to be identified with the systematic inde- 
pendent variable to which they are ordered. Sense I independents are terms 
in the construct language; Sense II independents are expressions in immediate 
data language (cf. “explanation,” item 43}). A Sense II independent vari- 
able need not be a “realization”? of a Sense I independent; empirical rela- 
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tions between experimental variables which are ordered to no extant theory 
are often investigated. 


SENSE II. “MATHEMATICAL” INDEPENDENT VARIABLES 


All terms in a statement of functional dependency of which a given term 
(the dependent variable) is a specified function. This corresponds roughly to 
the usage of “independent variable” in mathematics. We give this rather 
obvious usage for purposes of completeness. 


It might be added, at this point, that in most instances systematic inde- 
pendent and dependent variables are introduced into a system and given 
empirical meaning by some stipulated linkage(s) to a set of empirical in- 
dependent or dependent variables (this is one way of elucidating what is 
meant by so-called “empirical” or “operational” definitions). Thus, in the 
present analysis, a systematist may wish to employ some such distinction 
when discussing, such questions as “mode of definition of representative 
variables” [item 42}(b)] and certain other questions introduced in later 
sections [e.g., items 43}(c) and (@)]- 


{3} Initial evidential grounds for assumptions of system 


(a) Identify the chief classes of experimental and/or empirical 
data which have served as the initial source of evidence on which the 
system was based, or have been used in any way to suggest the major 
assumptions of the system. 

(b) Why was this material considered “strategic,” or in some 
sense “fundamental,” relative to: 

(1) other sources or varieties of data within the same empirical 

area, 

(2) data in other empirical areas for which the system is intended 

to hold? f 

(c) Isolate the chief empirical independent and dependent vari- 
ables (in “theoretically neutral,” “immediate data language” terms) 
in the evidence on which the system is based. 

(d) Show how empirical independent and dependent variables (as 
expressed in “immediate data language”) are linked to systematic 
independent and dependent variables (construct language). 


Explanation 

In the discussion of (a), it would be interesting for the systematist to 
consider whether, in general, the system has thus far been based primarily on 
extant empirical data, or whether the systematic program has been contingent 
on the prior extension, or “opening up,” of a field of data by the individual 
systematist, or group of investigators working within the systematic context. 

In (d) we have reference to the distinction between systematic independent 
and dependent variables (Sense I) and empirical independent and dependent 
variables (Sense II), precisely as made above (cf. “explanation,” item {2}). 
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For uniform understanding of items (c) and (d), it might be useful to 
specify what we mean by “immediate data language.” One may say that 
all empirical (“operational”) definitions of a system are constructed from a 
linguistic base that may be called the “data language” of the system in 
question. Immediate data language is the language, presumably univocally 
intelligible to all competent workers in the field, in which empirical or 
operational definitions of systematic terms are put forward, and against 
which primitive and derived Statements of the system are compared. In 
general, then, “immediate data language” tends to appear in two contexts 
in connection with an empirical system: 

1. in statements which are explicitly intended to provide operational 

definitions of terms in the construct language, and 

2. in descriptions of experimental (or general empirical) conditions, 

observations, and the results of Statistical or mathematical transforma- 

tions of observations which the systematist or investigator is relating 

in some way to the construct language of the system. 
One may distinguish “immediate data language” from another sense in which 
“data language” is often used in methodological discussions—i.e., as the 
“epistemic reduction basis” of the terms of a system. This involves reduction 
of the systematic (construct language) terms to the “ultimate” confirmation 
language to which all Proper statements of the system are, in principle, 
reducible. We are not concerned with “data language” in this latter sense 
in the present group of discussion topics. 


14} Construction of function forms 


(a) How are independent-intervening-dependent variable—or, in 
the case of “positivistic” systems, independent-dependent variable 
function specifications constructed? 

(b) Rationale of, and grounds for confidence in, the procedure. 

(c) Contemplated modifications or extensions of the procedure as 
the theory develops. 


(d) Grounds for favoring employment or nonemployment of inter- 
vening variables. 


Explanation 


When thrown into independent-intervening-dependent variable form, 
any system will contain stipulations, at one level of explicitness or another, 
with respect to the interrelations among these variables. Such construct 
linkages will vary from rather general adumbrations of the functional 
relationships to highly specific descriptions of function forms. Thus, “‘func- 
tion specifications” may range from “purely qualitative” verbal descriptions 
through varying degrees and modes of quantitative explicitness, depending 
on the systematic intentions, the area under systematization, etc. 

3 Such function-form specifications are, in one sense, free and creative 
„constructions on the part of the theorist. In another sense, however, they 
come from somewhere,” and are “arrived at” on the basis of some set of 
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rules, however implicit. It would be most useful if systematists participating 
in the present study, would make an attempt to explicate or reconstruct 
their procedure in arriving at the specification of function forms. In the 
case of some systems, construct interrelations may register in a relatively 
direct way the interrelations among empirical variables, as determined in 
specific experiments or empirical studies which are believed to have funda- 
mental significance. Such relationships may be “transposed” to the syste- 
matic variables in a variety of ways, ranging from empirical “curve-fitting” 
to verbal descriptions of the trend of the findings. In the case of other sys- 
tems, the construct linkages may apparently be arrived at by “rational 
analysis,” but in ways which are differentially based on inductive evidence, 
and which may range in form from the positing of rational equations to the 
stipulation of verbally formulated, qualitative interrelations. In still other 
cases, the technique of function construction may be partly “empirical” and 
partly “rational,” as combined into various concrete strategies. 


15} Mensurational and quantificational procedures 


(a) What procedures are either specified or presupposed by the 
system with respect to the “measurement” (in the broadest sense) 
of the systematic independent and dependent variables? 

How would the “level” or type of mensurability presently char- 
acteristic of the systematic independent and dependent variables be 
located by the systematist within the terms of the logic of measurement? 

(b) To what extent do the procedures for “measurement” of the 
systematic independent and dependent variables satisfy the mathe- 
matical requirements of whatever quantitative techniques are em- 
ployed for the description of function forms? 

(c) What is the systematist’s estimate of the principal difficulties 
in the way of increasing the mensurational and quantitative adequacy 
of the system? Future plans with respect to the mensurational and 
quantitative development of the system. 

(d) Views of the systematist with respect to limitations, in principle, 
on “level” of measurement and degree of quantitative specificity of: 

(1) his own system, 

(2) systematic efforts in psychological science generally. 


Explanation 


Obviously, certain of these discussion topics will not be relevant to 
many of the systematic formulations sampled within the present study. 
Some formulations will be nonquantitative, in principle. Others will be 
prequantitative in their current form. In such cases, it would nevertheless 
be of great interest for the systematist to discuss items (c) and (d). 


16} Formal organization of the system 


(a) Status of the system with respect to explicitness of axiomatiza- 
tion, and of derivational procedures employed. 
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(b) What factors (e.g., “strategic,” “empirical”) are responsible 
for the present mode of formal organization of the system? 

(c) Views of the systematist about the ultimate level of formal 
explicitness for which it is desirable, in principle, to aim. 


Explanation 


Explicitness of axiomatization and derivational specificity or rigor can 
clearly vary over a very wide range, from informal exposition to detailed 
hypothetico-deductive development within the resources of mathematical 
notation and symbolic logic. It would be interesting if, in the discussions of 
the above topics, the systematist would present his views on such questions 
as the degree of “formalization” which he feels it may be fruitful to aim 
towards, in areas other than those to which his own systematic work is 
relevant. 

In the discussion of “formal organization” a recapitulation of the defi- 
nitional techniques employed within the system would be highly useful. 
Ideally, this would include a reconstruction of the roles of “implicit” (i.e. 
“postulational”’) definition, “explicit” definition, empirical or “operational” 
definition, and, in certain cases, “coordinating” definition, as these are 
respectively realized within the system. 


17} Scope or range of application of system 


(a) Actual scope, as the system is currently constituted. 

(b) Intended, ultimate Scope and grounds for this delimitation. 
Concrete plans and programmatic devices for extension. 

(c) Interrelations, present and potential, with formulations of 
other systematists in: 

(1) areas coextensive with system, and 

(2) other empirical areas, 


18} History of system to date in mediating research 


(a) Itemization of the chief experimental or empirical research 
studies, or clusters of such, which the system has directly (i.e, by 
logical implication) or indirectly (i.e., by suggestive or heuristic 
guidance) instigated. 5 

(b) What specific components of the system—e.g., orienting 
attitudes, general but incompletely specified “explanatory mecha- 
nisms” or constructs, specific lawful assumptions, methods—have been 
responsible for the research instigated by the system? 


19} Evidence for the system 


(a) Current status of the “positive” evidence for the system (to 
the extent that this is not covered in item {8} above). 
(6) Major extant sources of incompatible or “embarrassing” data. 
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(c) Specification of experimental designs which would be re- 
garded as “critical? or important tests of principal foundation 
assumptions. 

(d) Types of data which, in the opinion of the theorist, the theory 
accounts for more successfully than do alternate formulations. Classes 
of data which alternate formulations handle more successfully, 


{10} Specific methods, concepts, or principles of the system 
believed valuable outside the context of the system 

(2) Methods, concepts, or principles deemed fruitful for systematic 
advance in areas outside the projected range of application of the 
system. 

(b) Chief methods, concepts, or principles believed to be of long- 
term significance, independently of the over-all structure or detailed 
assumptional content of the system. 


111} Degree of programmaticity 

(a) Evaluation of the over-all extent to which the systematic 
program has been realized, at the given time. 

(6) Estimation of the extent to which the system is tending towards 
convergence with other coextensive systems, articulation with systems 
having different empirical domains, subsumption of more limited 
systems, or subsumability under more general ones. 


112} Intermediate and long-range strategy for the develop- 
ment of the system 


(a) What classes of empirical relationships does the theory most 
require knowledge about, and in what priority order? h 

(6) Estimate of the chief conceptual and empirical difficulties 
working against the development of the system. ; 

(c) Estimates, based on the systematist’s experience, of the chief 
barriers blocking general theoretical advance in psychology. 


NOTE ON THE USE OF DISCUSSION 
TOPIC INDEX NUMBERS 


As a convenience for the reader interested in the relation of essays 
to the discussion topics and in the cross-comparison of positions on 
key issues, index numbers Corresponding to the twelve discussion 
themes have been inserted at relevant places in the Table of Contents 
preceding each of the essays. These numbers are placed in brackets 
immediately following the germane rubrics of the author’s plan of 
discussion. 

By and large, correspondences between authors? organization and 
the discussion topics are straightforward, and can easily be identified 
from the author’s formulation of headings. Not infrequently, however, 
an author’s system of headings may, in one or another way, be out 
of phase with the discussion rubrics, even though some or all of the 


relevant issues are considered. This circumstance has led to the 
following conventions: 


The section designated by a given author-heading may be relevant to two 
or more themes. In such cases, the brackets will contain the requisite plu- 
rality of index numbers, e.g. {3, 8, 9}. 

In cases in which a section, or some part of it, is primarily relevant to a 
given theme but includes brief, partial, or implicative consideration of a 
number of others, that is indicated by a + after the index number of primary 
relevance, e.g. 12+}. 

hen a section encompasses a number of discussion topics but gives them 
markedly different attention or emphasis, it has occasionally seemed worth 
setting the bracketed numbers in an order which roughly reflects this, e.g. 
14, 5, 3}. Since such discriminations of relative emphasis cannot always be 
clearly made, there is no implication that index numbers are not differentially 
relevant when they are given in consecutive numerical order, e.g. {4, 5, 6}. 


We should note, also, certain general restrictions on the use of in- 
dex numbers: 


With very few exceptions, they have been used only in conjunction with 
major subdivisions of the papers (i.e., headings of high “value”), the excep- 
tions having been mainly cases in which essays contain a final section 
Specifically for the purpose of bringing aspects of the preceding discussion 
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to bear on the themes. In such cases, index numbers have been inserted to 
identify the themes dealt with in relevant subsections. 

Index numbers uniformly pertain to discussion themes as a whole, and 
do not separately identify the subitems which invite differentiated discussion 
under each theme. Once the correspondence with a given theme is identified, 
the reader will find that, in most instances, the bearing on particular sub- 
themes is easily discriminated. 


In several papers, the author’s plan of organization is such as to 
preclude the insertion of index numbers. In some of these (e.g., 
Ellson, Skinner) the author has preferred a type of discursive pres- 
entation sans headings— or has used so few of them that any use of 
index numbers would have been nondiscriminating. In a few cases 
(e.g., Pirenne and Marriott, Kallmann), the author’s organization 
is so markedly out of phase with the discussion themes as to make any 
use of the numbers either confusing or unnatural. Nevertheless, it will 
be found in most of these cases that it requires little effort to determine 
the author’s position with respect to many of the thematic issues. 
There are a few essays, however, to which certain of the suggested 
themes are not relevant in principle in that the concern is primarily 
with presystematic issues (e.g., Harlow). 

Whatever the explicitness of relation of each paper to the themes, 
it should be emphasized that each is a self-contained essay, having 
sui generis properties in substance and form. Any cross-comparison 
or integration of findings which the reader may wish to conduct must 
depend on his own active discriminations; it will not be provided 
ready-made by any mechanical device. The present system of indexing 
is offered merely as a convenient starting point for comparative 
analysis. It has been kept typographically inconspicuous, and used 
in conjunction only with molar rubrics, both of author and thematic 
organization, so as not to interfere with the organic unity of each 


presentation. 


NAME INDEX 


Page numbers in boldface type indicate bibliography references; “n 


footnote reference 


Ach, N., 39, 84 

Adams, P. A., 125, 156 

Ades, H. W., 509, 536 

Adler, D. L., 35, 40, 41, 84 

Adriaanse, M. S. C. A., 598 

Adrian, E. D., 598 

Aiken, E. G., 332, 354 

Alberts, E., 533 

Alexander, H. B., 159 

Allport, F. H., 67, 84 

air, G. W., 51, 84, 130, 152, 334, 
34: 

Alper, R. G., 509, 536 

Alpert, A., 534 

Ames, A., 115 

Ammons, R. B., 348 

Amsel, A., 357 

Anderson, N. H., 418, 486 

Anderson, N. S., 660 

Andrew, G., 510, 511, 534 

Andrew, R. J., 598 

Anliker, J. E., 374, 377, 379 

Anrep, G. V., 195, 291, 560 

Archer, E. J., 548 

Aristotle, 169 

Armstrong, E. A., 599 

Arnold, W. J., 346, 348, 505, 535 

Aronson, 589 

Arsenian, J. M., 37, 85 

Asch, S. E., 44, 85 

Atkinson, J. W., 34, 85, 154 

es: R. C., 415, 418, 442, 443, 486, 
488 

Auld, F., 228, 286 

Aull, G., 40, 88 

Avenarius, R., 160 


Back, K., 37, 45, 85, 87 

Baerends, G. P., 563, 584n., 587, 592, 
596, 599 

Baerends, J. M., 599 

Bahrick, H. P., 661 

Bailey, ©. J., 152, 226, 286 

Bare, J. K., 291 

Barker, R. G., 12, 29, 42, 43, 73, 74, 76, 
85, 95n., 152 

Barnes, G. H., 433, 486, 661 

Barnes, H. W, 503, 515, 534 

Baron, A., 505, 537 

Barry, H., III., 335, 348 


“n.” indicates 


Bastock, M., 599 
Bavelas, A, Bin, 42, 44, 45, 46, 61, 85 


Bayes, 629 

Beach, F. A., 254, 285, 589, 593, 599 

Beecher, H., 374 

Beier, E. M., 333, 353 

Bell, D. A., 634 

Benepe, O. J., 652, 654, 660, 661 

Bennett, E. B., 44, 85 

Bennett, W. F., 661 

Bergmann, G., 286, 299, 348, 356 

Berkun, M. M., 152, 220, 228, 235, 269, 
286, 288, 291 

Berlyne, D. E., 101n., 152, 269, 287, 348 

Beusekom, G. van, 599 

Bingham, H. C., 510, 534 

Biot, 650n. 

Birch, H. G., 331, 334, 348, 510, 534 

Birge, J. S., 246, 287 

Birukow, G., 599, 560 

Bishop, C. K., 442, 486 

Bitterman, M. E., 331, 332, 334, 336, 
338, 348, 349, 350, 357 

Black, A. H., 292 

Blodgett, H. C., 103n., 122, 149, 152, 
334, 337, 349 

Bloomberg, R., 152 

Blough, D. 8, 371, 376, 379 


23949. 

Boerema, L. K., 609 

Boguslavsky, G., 167 

Bolton, T., 159, 160 

Bosanquet, B., 160 

Bowles, R., 153 

Bradley, F. H., 160 

Brady, J. V., 373 

Brand, H., 418n., 487 

Braun, H. W., 503, 515, 516, 534 

Breland, K., 371, 379 

Breland, M., 379 

Brener, F., 598 

Bretnall, E. P., 357 

Briggs, G. E., 336, 357 

Broadbent, D. E., 182, 183, 195, 584, 
600 


Brody, A. L., 414, 439, 440, 487 
Brogden, W. J., 333, 336, 349 
Bronwer, R., 599 


677 


678 NAME 


Brown, J. S., 210, 211, 212, 218, 265, 
281, 282, 287, 290, 349 

Brown, M. E., 599 

Brown, W. L., 333, 350 

Bruce, R. H., 101, 153 

Bruner, J. S., 44, 85 

Brunswik, E., 69, 73, 74, 85, 96, 97, 
103, 124n., 153, 156, 410, 411, 487 

Buchholtz, C., 600 

Bugelski, R., 209, 287 

Bunch, M. E., 543 

Burke, C. J., 349, 380n., 422, 424, 430, 
432, 438, 484, 485, 487, 488, 660 

Burrows, R. H., 349 

Bush, R. R., 380n., 387, 418, 430, 437n., 
477, 481, 482, 483, 484, 485, 486, 
487, 612, 634 

Büsse, 160, 190 


Calvin, A. D., 349 

Cameron, F., 661 

Campbell, B. A., 255, 256, 258, 259, 
287, 291 

Campbell, D. T., 335, 349 

Campbell, N. R., 437n., 487 

Cannon, W., 369 

Cantril, H., 115, 154 

Carmichael, L., 86 

Carnap, R., 201, 287 

Carpenter, C. R., 589 

Carr, H. A., 334, 349, 543, 544 

Carterette, T., 416n. 

Carthy, J. D., 600 

Cartwright, D., 4, 7, 29, 34, 36, 37, 41, 
43, 63, 64, 65, 71, 85, 86, 87, 88, 


Cassirer, E., 16, 48, 49, 86 
Chapanis, A., 661 

Chein, T., 47. 73. 86 
Cherry, E. C., 634 

Child, 1. L., 292 

Chow, K. L., 509, 534 
Choy, M., 278, 279, 287 
Citron, A. F., 47, 86 

Clark, K., vi, vii 

Clark, L., 352 

Clark, R. A., 154 

Coch, L., 45, 86 

Cohen, A. R., 37, 46, 86, 91 
Collias, N. E., 600 

Collier, G. H., 487, 491 
Collins, M, E., 47, 86 
Conant, J. B., 287, 331, 350, 539, 559 
Conger, J. J., 226, 236, 287 


Cook, S. W., 47, 91 
Coombs, C. H., 487 
Coons, D., 233 

Cotton, J. W., 350, 491 
Cowles, J. T., 129, 153 
Craig, W., 563, 572, 600 
Crane, J., 600 


Cronbach, L. J., 12, 87, 613, 634 
Crossman, E. R. F. W., 630, 634 
Crowcroft, P., 600 


INDEX 


Crozier, W. J., 362 

Crum, L., 333, 350 
Crutchfield, R. S., 143, 144n. 
Cullen, E., 600 

Culler, E., 333, 349 


Daanje, E., 600 

Darwin, C., 160 

Davis, R. L., 487 

Davis, R. T., 503, 516, 517, 518, 534 

Delgado, J. M. R., 126n., 153, 287, 291 

Dember, W. N., 157 

Dembo, T., 12, 28, 29, 41, 42, 43, 76, 
77, 82, 85, 86, 89 

Democritus, 49 

Dennis, W., 518, 534 

Denny, M. R., 350 

de Ruiter, L., 607 

Desiderato, O., 125n., 153 

Detambell, M. H., 412, 414, 415, 418, 
419, 481, 482, 487 

Deutsch, M., 9n., 46, 47, 82, 86 

Dewey, J., 160, 369 

Dews, P., 373 

Doll, E. A., 164, 165 

Dollard, J., 28, 55, 86, 199, 226, 238, 
ae 268, 272, 287, 290, 336, 350, 
54 


Drees, O., 600 
Dufort, R. H., 350 


Earl, R. W., 157 
ea H., 190, 538, 540, 541, 542, 
5. 


Ebeling, E., 355 

Eglash, A., 333, 350 

Ehrenfreund, D., 337, 339, 350, 533 

Eibl-Eibesfeldt, T., 600, 601 

Eikmanns, K. H., 601 

Elkind, J. I., 660 

Elliott, M, H., 153 

Ellis, D. S., 350 

Ellis, R. A., 353 

llson, D. G.; 5, 380n., 637, 660, 661, 

675 

Epictetus, 16 

Erickson, T. C., 536 

Eriksen, C. W., 634 

Escalona, S. K., 9n., 43, 70n., 86 

Estes, W. K., 4, 11, 86, 155, 167, 169, 
195, 317, 334, 350, 352, 353, 370, 
372, 380, 402, 408, 418, 422, 428, 
430, 477n., 482, 483, 484, 485, 487, 
488, 489, 490, 605, 610, 613 

Euclid, 193 

Evans, H. E., 589, 601 

Evans, S., 534 

Ezriel, H., 20, 86 


Faber, A., 601 
Fabricius, E., 601 
Falk, J. L., 87 


NAME INDEX 


Fano, R. M., 615, 621, 630, 634, 635 

Faraday, M., 360 

Farber, M.L, 24, 87 

Fechner, G. T 58 

Feigl, H., 288 

Feller, W., 488 

Felsinger, J. M. von, 253, 292 

Fernald, M. R., 180 

Ferster, C. B 367, 372, 374 

Festinger, L. 
79, 82, 86, 

Ficks, L., 636 

Fiedler, K., 601 

Fields, P. E., 167, 191, 195, 511, 534 

Finch, G., 661 

Fink, J. B., 423, 488 

Finocchio, D V. 502, 536 

Fisher, R, A., 370, 379 

Fitts, P. M., ‘661 

Fontaine, A. B., 661 

Ford, E. B., 609 

Förster, H. von, 551, 560 

Frank, J. D., 37, 43, 87 

Franklin, M., 34, 89 

Frankmann, ip P., 429, 488 

Frazisket, L 01 

Fredericson, Ei 608 

Free, J. B., 

Freids, D; > 350 

French, J. R. D. , Jr., 37, 42, 45, 46, 63, 
65, 86, 87 

Frenkel- Brunswick, E., 153 

Freud, S., 20, 41, 
378 

Freund, A., 42, 87 

Frick, F. Œ., 5, 387, 490, 611, 635 

Frisch, K. von, 601 

Fuller, J. L., 287 


, 


Gabor, D. A., 635 
Gage, N. L., 12, 87 
Gagne, R. M., 488 
Galanter, E., 337, 350 
Galileo, 49, 50 

Gant, W. H., 331, 353 
Garatlini, 57 290 
Gardner, R. A, 412n. 
Garner, W. R., 633, 635 
Geier, F. M., Í42n., 153 
Gelb, HN 80, 87 
Gellerman, L. W., 511, 534 
Gentry, G., 149, 350 
Gerstman, is S., 635 
Ghetti, V, 290 
Gibson, E. ake 350 
Gilbarg, D., 648, 650, 660 
Ginsberg, A, 300n., 350 
Glanzer, M., "140, 153 
Glaze, J. A., 544, 559 
Gleitman, H., 350 
Goethe, F., 601 
Goldenweiser, A., 160 
Goldman, H. M., 351 
Goldstein, K., 80, 87 


37, 43, 44, 45, 65, 71, 
89 


160, 267, 273, 287, 


679 


Goodnow, J. J., 488 

Goodwin, Da Sor, 602 

Goss, A. E., 

Graham, G iv , vii, 6, 488 

Grandene, L., 350, 534 

Grant, D. A. 350, 412, 486, 488 

Gray, F. E., 589, 660 

Green, E. if 488 

Grice, G. R., 351, 557n. 

Griffin, D. R., 602 

Grinker, R. R., 230, 288 

Grose, R. F., 246, 288 

Grosser, D. «45, 87 

Gulliksen, H. A, 338, 351, 489 

Guthrie, E. R., 4, 132, 133, 149, 153, 
158, 195, 199, 260, 261, 288, 331, 
334, 351, 397, 399, 403, 459n., 480, 
481, 489, 530, 533, 534, 540 

Guthrie, H. M, 195 

Guttman, N., 388, 435, 465, 489 


Haagen, C. H., 559 

Haartman, L. von, 602 

Haire, M., 127n., 153 

Hake, H. W., 412, 488, 634, 635 

Haldane, J. B. S., 608 

Hall, C. S., 357 

Hall, M. F., 606 

Hall, V. P., 544, 559 

Hanratty, J. A., 154 

Harary, F., 61, "63, 64, 65, 86, 87, 88 

Harding, 16, 47, 86 

Hardy, A. G; 609 

Harker, G. S., 321, 322, 351 

Harlow, H. F., 4, 101n., 153, 249, 252, 
260, 269, 288, 350, 351, 444, 489, 
492, 498, 503, 508, 509, 510, 
513, 515, 517, 523, 525, 534, 535, 
536, 537, 675 

Harriman, A. E., 288 

Hart, G., 290 

Hartley, P. H. T., 602 

Hartman, H., 199 

Hayes, C., 502, 503, 507, 508, 535 

Hayes, J. S., 607 

Hayes, K. J., 502, 503, 507, 508, 535 

Heath, P., 90 

Hebb, D. O., 243, 508, 530, 535, 602 

Heider, F., 44, 64, 65, 75, 88 

Heinroth, K., 602 

Heinroth, M., 602 

Heinroth, O., 563, 602 

Hellyer, S., 422, 434, 487, 489 

Helson, H., 153, 535 

Hemple, G. C., 288 


Henle, M., 40, 88 
Herb, F. H., 125n., 153 
Heron, W. T., 369, 370, 379 


Herrnstein, 373 
Heyman, M. N., 351 
Hick, W. E., 584 
Hicks, L. H., 511, 535 
Hickson, R., 416n. 


680 NAME 


Hilgard, E. R., 294, 331, 334, 335, 351, 
485, 489 

Hill, C. W., 337, 351 

Hill, L. M., 535 

Hinde, R. A., 5, 561, 574, 602 

Hinman, 160 

Hobhouse, L. T., 171, 186, 190, 195 

Hoffmann, K., 603 

Holst, E. von, 568, 589, 603, 606 

Holz-Tucker, A. M., 254, 285, 599 

Holzapfel, M., 603 

Homme, L. E., 428, 489 

Honzik, C. H., 128n., 153, 154 

Hoppe, F., 43, 88 

Hornseth, J. P., 412, 488 

Ben G. P., 132, 133, 153, 184, 190, 
195 


Horwitz, M., 36, 82, 88 

Hovland, C. I., 543, 546 

Howard, E. E., 563 

Howard, H. E., 603 

Hubert, F., 603 

Hughes, L. H., Jr., 434n. 

Hull, C. L., 4, 102, 103n., 133, 134, 
149, 151, 153, 154, 169, 171, 173, 
177, 186, 194, 199, 205, 233, 234, 
238, 248, 253, 255, 260, 274, 285, 
288, 293-348, 351, 352, 384, 386, 
387, 438, 441, 449, 459n., 469, 474, 
481, 489, 533, 539, 540, 551, 553, 
559, 560 

Humphreys, L. G., 299, 332, 351, 352, 
411, 436, 489 

Hunt, H. F., 373 

Hunt, J. McV., 89, 90, 289 

Hunter, I. M. L., 337, 352, 362 

Hurwitz, J. I., 37, 88 

Huxley, J. S., 563, 603, 609 

Hymovitch, 88 


ny IT, 


Tersel, J. A. A. van, 592, 603, 609 
Trey, E., 155 

Trion, A. L., 4, 490, 538 

Irwin, F. W., 154 

Ittelson, W. H., 154 


Jackson, L. L., 154 

Jackson, W., 635 

Jacobs, W., 603 

James, H., 337, 352 

Janet, P., 169, 170, 173, 195 
Jarvik, M. E., 412, 489, 524, 535 
Jasper, H. H., 598 

Jenkins, J. J., 154 

Jenkins, W. O., 332, 352 

Johns, M., 413, 414, 436, 477n. 
Johnson, H. M., 170 

Jones, H., 354 

Jones, M. R., 153, 155, 156, 287, 288 
Joos, M., 600 

Jordan, C., 489 

Jucknat, M., 43, 88 


INDEX 


Kagan, J., 269, 288 

Kalhorn, J., 45, 88 

Kalish, D., 143, 157, 357 

Kalish, H. I., 465, 489 

Kallman, F. J., 675 

Kaplan, J. J., 350 

Karman, 650n. 

Karn, H. W., 336, 352, 511, 535 

Karoly, A. J., 157 

Karsten, A., 31, 41, 42, 88 

Katona, G., 82, 88 

Katz, E., 40, 88 

Kaufman, E. L., 212, 288 

Keenleyside, M. H. A., 603 

Keller, F. S., 346, 357, 362, 369, 373, 
489, 535 

Kelley, H. H., 44, 45, 46, 79, 87, 88 

Kellog, W. N., 352 

Kemeny, J. G., 489 

Kendler, H. H., vi, vii, 6, 331, 332, 335, 
337, 352 

Kendler, T. S., 535 

Kennedy, J. S., 582, 603 

Kessen, M. L., 152, 155, 286, 290 

Kimble, G. A., 350, 352 

Kincaid, W. D., 333, 353 

Klemmer, E. T., 635 

Kling, J., 557n. 

Kliiver, H., 127, 154, 337, 352, 535 

Koch, S., vi, vii, 1-6, 214n., 309, 317, 

52 


Koenig, L., 603 

Koerner, L., 155 

Koffka, K., 68, 334, 352 

Köhler, O., 603 

Köhler, W., 67, 94, 95, 147, 154, 480, 
510, 535 

Kohn, M., 154, 288 

Koronakos, C., 505, 535 

Kortlandt, A., 579, 596n., 603, 604 

Kounin, J. S., 40, 42, 84, 88 

Kracling, D., 220, 287, 290 

Kramer, G., 604 

Krauskopf, J., 332, 349 

Krech, D., 85 

Krechevsky, IL, 128n., 154, 156, 339, 
352, 523, 535 

Krendall, E. S., 661 

Kruse, H. D., 290 

Kruyt, W., 609 

Kubala, A. L., 349 

Kuenen, D. J., 609 

Kuenne, M. R., 535 

Kühn, À., 579 


LaBerge, D. L., 489 

Lack, D., 584, 604 

Lambert, W. W., 129, 154 

Lamoreaux, R. R., 333, 354 

Lanier, L. H., vi, vii, 6 

Lashley, K. S., 133n., 149, 154, 248, 
288, 339, 352 

Laven, H., 604 


NAME INDEX 


Laver, D. W., 380n., 402, 410, 411, 428, 
433, 437, 488, 489, 4 490 

Lawrence, D. H., 246, 252, 288, 289, 
353, 490 


Leeper, R. W., 9n., 52, 63, 68, 69, 70n., 
78, 88, 154 

Lehrman, D. S., 563, 574, 585, 604 

Levin, M., 142n., 153 

Levine, M. 502, 518, 520 

Lewin, K., 4, 7— 84, 85, 88, 89, 136, 149, 
154, 235, ie 289, 480, 540 

Lewis, D. Ja 

Lewis, H. A, Do, 290 

Lewis, H. B., 34, 89 

Libo, L., 36, "89 

Lindemann, W., 604 

Lindsley, O. R., 373, 374 

Lindzey, G., 86, 88 

Lipman, E. A. 333, 349 

Lippitt, R., 29, 45, 81, 32 89, 90, 356 

Lissman, H. W., 589, 604 

Lissner, K., 35 40, 77, 89 

Locke, J., 390 

Loeb, J., 362 

Logan, F. A., 4, 242, 260, 289, 293, 333, 
335, 348, 353, 490 

Lohr, T., 374 

Lohri, H., 602 

Looney, E., 153 

Lorenz, K., 563, 567, 572, 573, 574, 582, 
583, 584, 585, 586, 587, 604, 605 

Loucks, R, B., 178, 331, 353 

Lowell, E. Lo 154 

Luce, E. D., 635 

Luchins, A. ise 353 

Luchins, E. H, 353 


Maatsh, J. L., 353 

McClelland, D. C., 130n., 154 

MacColl, L. A., 661 

McCollum, ig Ñ., 661 

McConnel, D. G, 428, 490 

MacCorquodale, Ka 57, 89, 97, 98n., 

123n., 130n., 154, 155, '300n. 334, 
336, 353, 570, 582, 583, 605, 606 

McCulloch, Mi L., 353 

McCutchan, K., 337, 349 

McDowell, A. A, 503, 516, 517, 534 

Macfarlane, D. A, 149, 154 

Me if A 490, 541, 543, 544, 546, 

0 


McGill, W. J., 481, 482, 484, 485, 486, 
490, 633, 634, 635 


Mach, Es 160 

McLeod, R. R. B., vii, 6, 11, 288 
McNemar, A., 85 

Magnus, D., 362, 363, 605 
Mahler, V., 35 41, 77, 89 
Maier, N. 


‚331, 353, 510, 535, 605 
Makkink, G. F., 605 
Maltzman, ite 334, aoe 354 
Manning, A. 599, 605 
Marler, P., 605 
Mae D. G., 331, 334, 335, 351, 489 


681 


Marriott, F. H. C., 675 

Marrow, A. J., 33, 90 

Marshall, H. R,, 180, 195, 373 

Marx, M. H., 300n., 354 

Maslow, A., 131, 155 

Mason, W. A. 535 

Masserman, I H., 226, 235, 236, 289 

Matheson, Ei 536 

Matthews, G. V. T., 605 

Maxwell, 201 

May, M., 199, 238 

Mayer, Ja 374 

Mayfield, y F., 154 

Mayne, R., 661 

Mayr, E., 606 

Maze, J. RI 300n., 354 

Mead, A. P., 607 

Mead, G. H, 82, 90 

Mechl, P. E., 57, 89, 90, 97, 98n., 123n., 
130n., 154, 155, 240, 289, 300n., 
334, 336, 353, 354, 570, 582, 583, 
605, 606° 

Meeuse, B. B. D., 609 

Melton, A. W., 543, 544 

Merrill, M. A., 85 

Meyer, D. R., 288, 336, 355, 416n., 490, 
500, 504, 506, 509, 525, 536 

Meyer, M., 542 

Meyers, C. ’E, 90 

Miles, R. C., 506 

Miles, W. R., 290 

Mill, J. S., 369, 541 

Miller, G. A, °387, 481, 482, 484, 
496, 490, 627, 630, 635 

Miller, N. E., 4, 43, 52, 90, 104n., 111, 
116, 126, 149, 152, 153, 155, 196, 
206, 207, 208, 209, 212, 219, 220, 
221, 226, 239, 246, 249, 260, 286, 
287, 288, 289, 290, 291, 294, 306, 
332, 334, 336, "350, 354 

Miller, R. E., 502, 536 

Milner, P., 155, 258, 291 

Minium, E., 141, 157 

Mittelstaedt, H. 603, 606 

Montgomery, K. C., 101n., 155, 269, 291 

Moon, L. E., 523, 536 

Moore, O. K., 354 

Morgan, C. L., 171, 186, 190, 195, 579 

Morris, C. M., 90 

Morris, D., 599, 606 

Morse, W. H., 373 

Moss, E., 517, 536 

Moss, F. A., 156 

Mosteller, F., 380n., 387, 418, 430, 437n., 
477, 481, 482, 483, 484, 485, 486, 
487, 612, 613, "634 

Mowrer, O. H. , 126, 149, 155, 267, 291, 
331, 332, 333, 334, 354 

Moynihan, M., 599, 606 

Mueller, C. G., vi, vii, 5, 6, 387, 490 

Muenzinger, K. F., 132n., 138, 149, 155 

Müller, G. E., 557, 560 

Munn, N. L., 511, 535 

Murchison, C., 288, 289 

Murdoch, G, P., 354 


485, 


682 NAME 
Murphy, J. V., 502, 536 

Murray, E. J., 219, 220, 221, 228, 291 
Murray, H. A., 155 

Myers, A. K., 235, 236, 269, 291 


Nachmias, J., 350 

Narasimhan, R., 660, 661 

Neet, C. C., 511, 536 

Neimark, E. D., 412, 414, 415, 418, 490 
Neisser, U., 350 

Newbold, 160 

Newcomb, T. M., 44, 79, 87, 90 
Newman, E. B., 635 

Newton, I., 50 

Nice, M. M., 589, 606 

Nicolai, J., 606 

Nissen, H. W., 536, 572, 606 
Noble, C. E., 354, 589 

Nolte, A., 606 

Norman, R. Z., 61, 63, 64, 87, 88 
North, A. J., 416n., 490 

North, J. D., 661 

Nowlis, V., 291 


Olds, J. A., 126n., 155, 258, 291 
Olmstead, D. L., 353 
Oppenheimer, R., 155 

Osgood, C. E., 136n., 155, 559, 560 
Ovsiankina, M., 32, 39, 40, 90 


Parker, C., 161 

Patton, R. A., 503, 516, 534 

Patton, R. M., 423, 488 

Pavlov, I. P., 149, 166, 174, 178, 179, 
185, 186, 189, 194, 195, 237, 274, 
ae 318, 362, 367, 494, 536, 554, 


Pelkwijk, J. J., 606 
Penfield, W., 536 
Pentrinovich, L., 153 
Pepitone, A., 37, 44, 73, 87, 90 
Perdeck, A. C., 580, 584, 609 
Perin, C. T., 354 

Perkins, C. C., Jr., 354 
Perkins, F. T., 558, 560 

Perry, R. B., 94, 149, 155 
Peterson, L. R., 333, 354, 423, 490 
Peterson, M. J., 490 

Pfaffman, C., 291 

Phillips, L. W., 140n. 

Picken, L. E, R., 606 

Pilters, H., 606 

Pilzecker, A., 557, 560 
Pirenne, M. H., 675 

Polansky, N., 45, 87, 89 
Pollack, 1., 635, 636 

Porter, L. W., 152, 291 
Postman, L., 125, 156, 334, 355 
Poulsen, H., 607 

Pratt, J. C., 353 

Precht, N., 607 

Prechtel, H. F. R., 607 


INDEX 


Prentice, W. G. H., 355 
Pribram, K. H., 374 
Pubols, B. H., 522, 536 


Quastler, H., 634, 635 


Rabaioli, E. J., 350 
Räber, H., 607 
Ramond, C. K., 355 
Ramsay, A. O., 607 
Ratliff, F., 371, 379 
Raven, B., 37, 87 
Razran, G. H, S., 186 
Redl, F., 45, 77, 89, 90 
Reed, P. C., 332, 349 
Reid, L. S., 522, 536 
Restle, F., 432, 481, 
503n., 504 
Reymert, M. L., 85 
Richardson, C. H., 490 
Richter, C. P., 254, 291, 360, 379 
Riecken, H. W., 44, 73, 90 
Riopelle, A. J., 504, 506, 508, 515, 518, 
536 


Ritchie, B. F., 111, 143, 156, 157, 333, 
355, 357 

Roberts, K. E., 508, 536 

Roberts, W. W., 153, 287, 291 

Robinson, E. S., 543 

Roby, T. B., 104n., 156, 258, 292 

Rodnick, E. H., vii, 6 

Roe, A., 535 

Rohrer, J. H., 557, 560 

Rosen, S., 45, 89 

Rosner, B. S., 353 

Ross, A. S, C., 634 

Rosvold, H. E., 153 

Roth, W., 355 

Roseboom, W. W., 15, 57, 90 

Ruiter, L. de, 607 

Russell, B., 161, 361 

Russell, E. S., 575, 581, 588, 607 

Russell, L., 661 

Russell, W. M. S., 607 


482, 485, 490, 


Sakoda, J. M., 418n., 487 
Sakuma, K., 89 

Sanford, N., 31, 90 

Sauer, E., 607 

Sauer, F., 607 

Schachter, S., 37, 45, 87 
Schaffner, B., 599, 604, 605 
Scheler, M. F., 12, 90 
Schenkel, R., 607 

Schiller, P, H., 510, 536, 607 
Schipper, L. M., 350 
Schleidt, M., 608 
Schlosberg, H., 333, 355, 368 
Schmidt, R. S., 608 
Schneirla, T. C. A., 331, 353, 574, 589, 


Schoeffler, M. S., 423, 490 


NAME 


Schoenfield, W. N., 489 

Schrier, A. M., 517, 518, 520, 525, 536 

Schwartz, R. D., 353, 355 

Schweers, R., 144n., ‘abe. 

Scott, J. P., 264, 291, 608 

Sears, P. S., 43, 82, 89, 291 

Sears, R. R., 291 

Seashore, H. E., 42, 90 

Segall, M., 155° 

Seibel, J. i 349 

Seitz, A., 608 

Sellars, W., 288 

Selltiz, C., 47, 91 

Settlage, P. H., 288 

Sevenster, P. A., 608 

Seward, J. P., 114n., 156, 317, 334, 337, 
355 

Shannon, C. E., 616, 621, 627, 628, 631, 
632, 634, 636 

Shaw, W. A., 337, 350 

Sheer, D. E., 290 


Sheffield, F. D., 104n., 156, 188, 195, 
255, 256, 258, 287, 291, 292, 333, 
336, 355. 


Sheffield, V. F., 355 

Sherrington, C. S., 362, 363, 572, 608 

Sidd, J. J., 292 

Sidman, M., 373 

Silver, C. A, 336, 355 

Simmons, K. E. Te 608 

Simon, H. A., 490 

Simpson, G. Gs 535 

Singer, E. A., 160, 190 

Skinner, B. Ds 4, 149, 171, 185, 292, 
32in., 343, 346, 355, 356, 359, 379, 
384, 386, 469, 474, 480, 490, 610, 
675 

Sliosberg, S., 5, 90 

Smedslund, J., 335, 356 

Smith, A., 489 

Smith, K. U., 511, 537 

Smith, M.B,, vii, 6 

Smith, S., 161 

Smith, W. A. S., 154, 195, 199 

Snell, J. L., 489 

Snyder, R., "46, 87 

Snygg, D., 68, 532 

Solomon, H., 374 

Solomon, L. N., 332, 354 

Solomon, R. L., 129, 154, 292 

Spalding, D. A, 608 

Spence, K. W., 4, 67, 68, 90, 124n., 
128n., 149, 156, 203, 260, 262, 284, 
286, 292, 293- 348, 356, 441, 481, 
490, 491, 512, 515, 533, 537 

Speigel, 1p P., 230, 287, 288 

Spieth, H. T., 608 

Spurway, Ha "574, 608 

Stanley, J. GC, 332, 352, 416n. 

Stellar, E., 3, 

Stevens, Cc. M., 

Stevens, S. S., è. 5389, 354, 356, 661 

Stevenson, S. S., 291 

Stone, C. P., 356, 535 

Stonehouse, B 608 


INDEX 683 

Stotland, E., 37, 46, 91 

Straughan, J. H., 408, 409, 412, 439, 
482, 483, 488, 491 

Stride, G. O., 608 

Strong, P. N ous 536 

Suci, G. J., 

Suppes, P., Jaon, 415, 473n., 486 


Tannenbaum, P. H., 155 

Taylor, J. S., 357 

Terman, L. M., 199 

Thibaut, J. W., 44, 45, 46, 73, 88, 90 

Thistlethwaite, D. A., 334, 357 

Thistlethwaite, E. C., 156 

Thompson, G. L., 489 

Thompson, J. P., 337, 357 

Thompson, M. E., 337, 357 

Thompson, R., 337, 357, 502, 503, 507, 
508, 535 

Thompson, W. F., 193, 194 

Thorndike, R. L., vii, 6, 95, 149, 
186, 190, 242, 274, 541 

Thorpe, W. a 577, 587, 595, 598, 


2, 608, 

Thorson, N., 503, 516, 517, 534 

Thrall, R. M., 487 

Thune, L. E., 337, 351, 545, 548 

Thurston, L. L., 168, 195 

Tinbergen, N., 81, 90, 563, 567, 
573, 577, 578, 580, 582, 583, 
585, 586, 587, 591, 594, 596, 
605, 606, 609 

Tinklepaugh, O. L., 265, 292 

Titchener, E. B., 560 

Tolman, E. C., 4, 43, 90, 92, 103n., 141, 
143, 153, 155, 156, 157, 167, 195, 
248, 262, 292, 318, 331, 334, 335, 
336, 337, 339, 353, 357, 480, 493, 
510, 537 

Toulmin, S. E., 565, 577, 609 

Towe, A., 182 

Trapold, 033 

Tryon, R. C., 150 

Tustin, A., 661 


167, 


572, 
584, 
602, 


Uexkull, J. von, 584, 610 
Underwood, B. J., 331, 352, 442n., 546 


van Beusekom, G., 599 

van Iersel, J. A. A., 592, 603, 609 

Varossieau, W. W., 609 

Veblen, O., 61, 90 

Verplanck, W. S., 491, 563, 581, 598. 
610 


Verwey, J., 563, 610 
Vince, M. A., 602 

Voeks, V. W., 422n., 491 
Voight, G., 74, 75, 90 

von Felsinger, J. M., 253, 292 
von Förster, H., 551, 560 
von Frisch, K., ‘601 

von Haartman, L., 602 


684 


von Holst, E., 568, 589, 603, 606 
von Uexkull, Ti 584, 610 
Vowles, D. M., 610 


NAME 


Wade, M., 339, 352 


Walk, R. D., 357 
Walker, E. Ls 143, 157 
Walkley, R. P., 47, 91 


Walpole, H., "og 

Walston, C. E., 662 

Ward, L. B., 545, 560 

Warden, C. T: 157 

Warren, C. E., 662 

Warren, J. M., 498, 504, 505, 509, 535, 
536 


Wasserman, H. N., 357 

Waterbolk, H. T., ’ 599 

Waters, R. H., 334, 357 

Watson, C. E. 662 

Watson, G., 85 

Watson, J; B: 29, 90, 94, 95, 149, 157, 
295, 362" 

Watson, P. D., 129, 154, 292 

Webb, W. B., 152, 253, 292 

Weber, E. H, 58 

Wegner, N., 332, 358 

Weidmann, Au 610 

Weinstein, B., 511, 537 

Weinstock, S., 357, 402, 416n., 437, 491 

Weise, P., "338, 35 7 

Weiss, E. q 357, 589 

Weldon, R. J., 355 

Weyl, H., 48, 91 

Wheeler, TR ‘661 

Wheeler, R. `H., 558, 560 

White, R. K., 45, 81, e 91, 228 

Whitehead, A. N., 16 

Whiting, J. W. M, 201, 292 

Whitman, C. O., 563, 610 


INDEX 


Wickens, D. D., 189, 336, 357 

Wickert, F., 292 

Wickler, W, 610 

Wiener, N., 248 

Williams, C. M., 349 

Wilner, D. M., 47, 91 

Wilson, J. T., ‘Vii 

Wilson, M. P, 346, 357 

Windle, W. F., 535 

Wineman, De ‘90 

Witmer, L, 560 

Wodinsky, È "349 

Wolfe, H. K., 159, 160 

Wolfle, D. , vvii, 1, 6, 338, 351, 422n., 
491 

Wolpe, J, 334, 358 

Woods, P, J., 418n., 487 

Woolsey, €. N., 536 

Wormser, M. H., 47, 91 

Wright, B. A., 43, 45, 91 

Wright, H. F., 12, 73, 74, 85, 152 

Wright, M. E., 37, 42, 91, 95n. 

Wundt, W., 159 

Wyckoff, BER: J., 358, 432, 478, 481, 
482, 491 


Yacorzynski, G. K., 189 
94° 


Yerkes, R. M., 
Yoshidka, J: G., 285, 292 
Young, P. T., 104n., 157 


Zajonc, R. B., 83, 84, 91 

Zander, A., 36, 37, 46, 85, 86, 88, 89, 91 
Zeaman, D3 332, 358 

Zeigarnik, B., 17, 19, 32, 33, 36, 91 
Zener, K. E. vi, vii, 6 

Zimbardo, P. G., 269, 292 

Zippelius, H. M, 610 


SUBJECT INDEX 


Topics followed by an asterisk are those treated by all or most authors (often ex- 
tensively). These pertain mainly to the “crosscutting” systematic and methodic issues 
raised by the themes of analysis and related editorial proposals. In most instances, 
page references for such topics are given only to basic definitions or explanations. 
Individual author treatments of many asterisked topics can be located by reference 
to the tables of contents appearing with each article, in conjunction with the use of 
discussion topic index numbers (see Note on the Use of Discussion Topic Index 


Numbers, pp. 674-675). 


Abstraction, method of, 48-50 

Achievement, drive, 270 
and Zeigarnik effect, 34 

Action Potential, Specific, 586 

Action research, 47 

Activity, and drive, 255-256, 269 
Vacuum and Overflow, 592 

Acts, 133, 177, 183-184, 185, 189, 252 
consummatory, 572-577 
pure-stimulus, 302 
receptor-orienting, 316, 339 

Adaptation, 524, 525, 530 
(See also Practice) 

Advertising, 273 

Afferent stimulus interaction, 316 

Aggression, in contiguity theory, 170 
in Lewinian theory, 55 

ee behavior, 577, 586-587, 590- 

9 

Agreement, intersubjective, 164 

Alcohol, 225-226 

Algebra, of inequalities, 285, 306-308 
matrix, 280 

Alien facts of life space, 70-71 

All-or-nothing behavior, 232-233 

Alternation, spontaneous, 518 

anne in linear frequency theory, 


in nonspatial discrimination, 515-516 
scaled, 435-436 
proba ace in conflict of forces, 27-28, 
and redirection in behavior, 594-595 
American Psychological Association, 
Policy and Planning Board of, vi 
Amnesia, combat, 226-227 
Amplification ratio, 645, 646, 650, 654 
Amplifying systems, 639, 641-642, 644 
Amplitude of response, 313, 327 
Analog computer (see Servo theory) 


Analysis, human motor behavior, in 
linear frequency theory, 638 
level of (see Level, of analysis*) 
mathematical, 470 
(See also Algebra; Equations; Ex- 
perimental design; Graph theory; 
Learning curves; Method; Mod- 
els*; Prediction*; Probability; 
Quantification*; Scaling; Statis- 
tics) 
rational, 671 
(See also Equations; Learning 
curves ) 
themes of (see Themes of analysis) 
of variance, 444, 633 
Anchoring, concepts, 309-311 
variables, 554-555 
Animal stunt training, 370-371 
Animals, rationale of experimental use of, 


Anthropology, relation of psychology to, 
272, 303 
Anxiety, nomothetic vs. ideographic 
study of, 372, 375 
stimulus-response analysis of, 267-269 
studies showing effect of treatment of, 
373 
(See also Fear) 
Apparatus, tracking, 651-652 
Apparatus breakdown, 367 
Appetitive behavior, 569, 572-577 
(See also Orientation) 
Applied psychology, 371-372 
(See also Linear frequency theory) 
Approach-avoidance conflict, 199-238 
Approach gradient postulates, 205-206, 
228-230 
(See also Conflict, theory of) 
Approach training, Tolman’s conception 
of, 99-117 
Approximation, and correction, 249-250, 
469-472 


685 


686 


Approximation, of equal-interval scales, 
285-286 
successive, 56-58, 233-234, 285— 
286, 621-622 
Aristotelian and Galileian thought, 48-50 
Art combined with theory, 203, 660 
Association, by contiguity, 158-195 
- afterthoughts on predictability, 193— 
195 


background factors and orienting 
attitudes, 159-170 

formal organization, 193 

formulation of method, 191-192 

general nature of theory, 170-172 

initial evidential grounds, 189-191 

introduction, 158-159 

quantitative procedures, 192-193 

scope of theory, 172-178 

suggestions for systematization of, 
178-189 

laws of, 168-169, 171, 185-189, 262, 

541 


(See also Law, of Contiguity, of Effect, 
of Exercise; Laws, of learning) 
Associationism, 242, 541-542 
(See also Association, by contiguity ; 
Laws, of association, of learning; 
Rote learning) 
Associative bonds, 39 
Associative learning, 460 
(See also Association, by contiguity ; 
Laws, of association, of learning; 
Rote learning) 
Senn. 165-166, 176, 179-183, 186- 
8 


in delayed response, 529 
focusing of, 242, 246, 252 
Attitudes, orienting* (see Themes of 
analysis) 
Authority and social influence, 45-46 
Autocorrelation, 652, 658 
Automatic pilot, 175-176 
ei gradient (see Conflict, theory 
of 


Avoidance training, 333 
robability theory of, 438-440 
olman’s conception of, 119-120, 126 
Axioms,* 671-672 
as definitions, 173, 200 
interrelations of, in theory, 470, 473n. 
(See also Hypothetico-deductive meth- 
od*; Postulates*; Theorems) 


Background factors and orienting atti- 
tudes* (see Themes of analysis) 
Barbiturates, 225-226 
Barriers, 23, 28, 78-79 
(See also Force) 
a agonistic, 577, 586-587, 590- 


all-or-nothing, 232-233 
ambivalent and redirection, 594, 595 
appetitive, 569, 572-577 


SUBJECT INDEX 


Behavior,* autochthonous and alloch- 
thonous, 594 
consummatory, 572-577 
defective, 373-374 
definition of, 68-70, 192 
description and classification of, 568- 


displacement activities, 586, 590, 593- 
595 
evolution and function of, 564, 567, 
577-578, 595-596 
flexible, 248-252 
general studies of, 590 
human motor, in lincar frequency 
theory, 638, 646-651 
innate, 574-575, 584-585, 589 
instinctive, and reflexes, 576-577, 579, 
586n. 
intelligent and stupid, 262 
lawful (see Lawful behavior) 
learned, 574, 594-595 
(See also Learning*) 
of machines applied to man (see 
Linear frequency theory; 
machine systems) 
maternal and sexual, 571, 591-592 
practical control of, 371-372, 375 
Predictability, 163, 193-195, 564-565 
(See also Prediction* ) R 
Purposive (see Principles of purposive 
ehavior ) 
random, 519-520, 523 
of scientists, 359-361, 378, 382, 539 
in space, 233-234 
See also Life space) 
speech, 638-639 
superstitious, 321n. 
threat and courtship, in birds, 571, 
a 586-587, 590-591, 594, 
95 
Behavior of Organisms, The, 369, 372 
ehavior objects, 73-74 
Behavior sciences, 303 
Behavior setting, 73-74 
ehavior-support, 103 
Behavorism, 165, 173, 174, 190-191, 295, 
361-362, 385, 480 
Dee oe means-ends readinesses, 105- 
1 
Binary computer, 639, 640 
Biological een 3, 175, 383, 565-566 
Bit, 615, 619, 621-622, 639-640 
(See also Information) 
Black box, 643 
Blank trial, hypotheses of effects of, 418 
(See also Nonreinforcement) 
Boredom, 260 
Boundaries (see Life space; Path; | 
Psychological environment; Region) 
rain, cortical localization and learning 
sets, 508-509 
electrical stimulation of, 258, 260, 
263n., 280 
filing system of, 247-248 
Bridging problems, 3 


Man- 


SUBJECT INDEX 


Calibration of learning material, 544— 
545, 547-549 
(See also Scaling) 
Capacity, intellectual, 247-248 
(See also Intelligence) 
Cardinal scales (see Scaling) 
Causation, 164, 183-184, 257, 492-495 
commensurability and, 52-54 
contemporaneous and genetic, 17-21 
ethologists’ use of causal, functional, 
and historical, 563-564 

(See also Description; Explanation; 
Language, of description*; Level 
of analysis*; Reductionism* ) 

Cell assembly, 243, 493 

Centers, nervous, hierarchical system of, 

567-568, 582-583, 586, 596-597 

Chena capacity, 623, 624, 628-629, 

6 


Channel theory of food habits, 47, 72— 
3 


Child development, 276 
Choice-point learning, Tolman’s con- 
ception, 120-122 
(See also Discrimination learning; 
Maze learning) 
Chronaxie of subordination, 368 
Classical conditioning, 127, 166-167, 
237, 312-313 
and habituatory decrement, 432-433, 
444 
and instrumental conditioning (see 
Conditioning) 
reinforcement in, 331-334 
Classification, 565-566, 579-580 
causal, 577 
functional, 577-578 
historical, 578-579 
Clinical derivations of S-R theory, 218- 
219, 228-230, 267-269, 274 
(See also Psychotherapy) 
Coding, 616-631 
binary numbers, 616 
ensemble of messages, 617, 622 
(See also Information; Information 
theory) 
Coding theorem, 611, 613, 621-622 
Cognition, 385 
Cognitive dissonance, 79 
Cognitive maps, 128, 336-337 
Cognitive structure of psychological 
environment, 25-26, 53, 83-84 
Cognitive theory, 243, 262, 339 
(See also Principles of purposive be- 
havior) 
Cognitive tuning, 84 
Combat neurosis, 230, 267-269, 275-276 
Commission on Community Interrelations 
of the American Jewish Congress, 
46-47 
Gen social, in animals, 595- 
596 
(See also Information theory; Linear 
frequency theory) 


687 


Communication systems, discrete and 
continuous variable, 639-641 
Comparable-trial error ratio, 515-516 
Comparative psychology, 445, 493-494, 
504-508, 566-567 
(See also Ethology) 
Compensatory tracking, 651-652 
Competition and cooperation, 46 
Components, in ethology, 571, 572-573, ` 
575, 580, 590-591 
in linear frequency theory, 640, 642— 
645, 647 


harmonic, 649 
Computer, binary, 639, 640 
man’s capacity as, 250-251 
(See also Cybernetics; Feedback; 
Linear frequency theory; Servo 
theory) 
Concept formation, 510-511 
Concepts,* anchoring of, 309-311 
commensurability and, 52-54 
comparison of Hull’s and Tolman’s, 
134-135 
dynamic, 23 
maturation of, 252 
methods of abstraction and construc- 
tion in development of, 48-58 
objectives of Lewinian compared with 
Hullian, 13 
operational, 384-385 
(See also Operational definition*) 
rote-learning and S-R, 551-552 
set-theoretical, 447-450, 456-461 
thing-concepts and relation-concepts, 
48 


(See also Constructs*) 
Conditioned differentiation, 371 
Conditioned Reflexes, 361 
Conditioned response, increase of cer- 
Kan of, 167, 171, 195, 261, 426- 
42 
Conditioning, classical and instrumental, 
comparison of, 127, 181, 185, 
261-262, 312-313, 317 
as experimental paradigms, 295-296 
verbal, 411-415, 422-423, 482-484 
(See also Classical conditioning; In- 
strumental conditioning) 
Conditions vs. principles, 203 
Conflict, as affected by drugs, 225-226 
approach-avoidance, 199-238 
in ethological terms, 594 
of forces, 27-28, 54 
(See also Decision; Equilibrium; 
Force; Life space; Person; Psy- 
chological environment) 
laws of, vs. experimental neurosis, 205 
plan of work for studying, 203-204 
as source of drive, 236-237 
(See also theory of, below) 
theory of, 204-238 
algebraic summation vs. dynamo- 
genesis, 225 
avoidance gradient’s greater steep- 
ness deduced, 212-213, 221-222 


688 
Conflict, theory of, deductions, 206-209, 
213 


verifications of, 209-212, 218— 
222, 228-230 
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590-591, 594, 595 
Creativity, 244-247 
Critical frequency, 645, 646, 659 
(See also Frequency analysis: Lincar 
frequency theory) 

Oitia] RAED, ae PAMER > 
tophenomenology, 94, 113, 14 
Gue 179-180, 182-183, 185, 186, 190- 

191, 455-456 
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words as, 302 
Culture, 274, 303 
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dicating next action, 174 
Deprivation, degree of hunger, and re- 
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tations, 116 

‚(See also Zeigarnik effect) 
Dis abies practice, 408-410, 444, 549- 
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goal-directed, 271 

from ideas, 270-271 

induced by conflict, 236-237 

interaction between hunger and thirst, 
223-225, 253, 278-279, 317 

as intervening variable, 277-280 

irrelevant, 253-254 

learned, innate factors in, 264-265 

learned channeling vs. elicitation of, 
263-266 

and learning, 259-260, 492-493, 531- 


532 
mediated, 271 
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second- and third-order, 131-132 
social, 266 
as stimulus, 253-256 
strength of, and theory of conflict, 
205-209 
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unitary vs. heterogeneous cluster, 280 
as used by ethologists, 585-586 
(See also Deprivation: Drive-stimula- 
tions; Hunger; Motivation*; Mo- 
tives; Need-reduction; Reinforce- 
ment*; Reward) 
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Drive-reduction hypothesis (see Need- 
reduction) 
Drive-stimulations, curiosity, 101n., 102 
discriminated, energizing, and value- 
giving properties of, 100-102 
drive for little effort and/or certainty, 
102-103, 117 
Driving force, 26, 36 
Drugs, effect of, on conflict, 225-226 
on drives, 280 
need to control stimulus change, 236 
techniques for evaluation of, 373 
Dynamism, stimulus intensity, 316 


Early learning, 122-123 
Ecology, 47, 72-80 
Education, learning from motion pictures 
in, 273 
Effect, Law of (see Law) 
Electrical stimulation of brain, 258, 260, 
263n., 280 
Emotion, as drive, 266-267 
as indicating next action, 174 
as reinforcement, 170, 267 
in thinking, 246-247 
(See also Drive*; Fear) 
Empathic ability, 12 
Empirical definitions,* 669, 670 
(See also Definitions) 
Empirical variables,* 668-669 
vs. systematic, 668, 669 
(See also Variables* ) 
Empiricism, 12, 97, 227-228, 542 
and method of abstraction, 48-50 
and relation to formal theorizing, 38— 
47, 80-81, 148-149, 200-204, 
280-281, 295, 313-314, 360-361, 
369, 384, 423-447, 469-472, 53g 
540, 612, 657-658 
in Hullian approach, 301, 303-306, 
330-347 
(See also Classification; Observation) 
Empty organism, 643 
Encephalization, 280 
Engineering psychology, 631 
(See also Linear frequency theory; 
Man-machine systems ; Servo 
theory) . 
Environment, relations between person 
and, 31-35 
(See also Life space; Person; Psycho- 
logical environment) 
Equations, difference, 400-402, 467 
rational and empirical, 200-201, 389- 
392, 671 
(See also Hypothetico-deductive meth- 
od*; Learning curves ; Method; 
Models* ; Statistics) 
Equilibrium, 214 
in correlated reinforcement situations. 
_ 343-347 $ 
in tension systems, 33, 54, 79 
Equivocation, 623, 625-626 
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Ergodic assumption, 619-620 
Error factor analysis, 494, 500, 512-526 
differential-cue and, 513, 515-516 
error factor producer and, 518-520 
position habit and, 513, 518 
response shift and, 513, 516-518, 525, 
527 


stimulus perseveration and, 513-515, 
524-525 
and uniprocess theory, 526-531, 533 
Errors, 163, 189, 194-195, 478-479, 518, 
555-556, 641-642, 646 
human tracking, 651, 654-656 
integrated, score, 647 
Escape training, movement patterns in, 
bi h f, 438-440 
robability theory of, 438—44 
Folman’s enon of, 117-119, 126 
Ethology, as distinct from ecology, 562 
European, 5, 563 
some recent trends in, 561-598 — 
background factors and orienting 
attitudes, 562-568 
description and classification of be- 
havior, 568-580 
history in mediating research, 589- 
597 


methods valuable outside ethology; 
597-598 
nature of system, 580-588 
construction of function forms, 
587 


dependent variables, 580-581 
independent variables, 581-582 
intervening variables, 582-583 
mensurational and quantification 
procedures, 587-588 
other hypothetical constructs, 
583-587 
organization of, 588-597 
scope and application, 597 
zoological bias of, 563-564 
Evidence for system (see Themes of 
analysis) re 
Evidential grounds for system, initial 
(see Themes of analysis) 
Evolution, of behavior, 564, 567, 577- 
578, 595-596 
of learning, 531 
Excitatory potential, 325, 526 
as pivotal construct in Spence’s the- 
ory, 312-314 4 
Excitement and depression as states in- 
dicating next action, 174 
Exercise, Law of (see Law) 
Expectancy, in means-end readinesses, 
105-107, 113-117 2 
measured by disruption experiments, 
16 


symbolized as intervening variable, 
107-108 
use of, in contiguity theory, 168, 173 
Expectation as related to time perspec- 
tive, 54 
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Experimental design, 276-280, 360, 369— 
370, 375-376, 544-546, 549 
critical, 673 
(See also Equations; Method; Mod- 
els*; Prediction* ; Probability; 
Fre Scaling; Statis- 
tics 
Experimental neurosis, 205 
Explanation, and description (see De- 
scription) 
and prediction, 202 
scientific, 162-164, 295 
(See also Causation; Language of de- 
scription*; Level of analysis* ; 
Reductionism* ) 
Exploratory drive, 101n.-102n., 130n., 
493, 517 
and latent learning, 125 
(See also Curiosity) 
Expression of the Emotions, 160 
Extinction, in probability terms, 399- 
400, 417 
resistance to, 128-129, 313, 327, 428 
‚and partial reinforcement, 436-438 
rigid cognitive maps and, 128-129 
spontaneous recovery from, in statis- 
tical theory, 463-464 
use of interfering responses in, 190, 
318, 400, 437-438 
Extinction curve, 367 


Factorial designs, 305-306 
(See also Experimental design) 
Facts, 161-162, 165, 369, 384, 446 
Failure and success, and level of aspira- 
tion, 43 
and Zeigarnik effect, 33-34 
Fatigue, 169-170, 174 
theory of distributed practice, 553 
Fear, as basis for variety of motives, 266 
conditioning of, to new cues, 263 
control of, 268-269 
and gradient of reinforcement, 213, 
225-226, 228-230 
measures of, 235 
new view of, 266-269 
as related to time perspective, 54 
(See also Anxiety) 
Fechner’s law, 165, 307-308 
(See also Weber’s law) 
Feedback, 568 
circuit, 641-642, 647 
negative, 216, 232, 641-642 
positive, 232-233 
Proprioceptive, 250-251, 314 
stimuli, 103, 115, 188, 214 
(See also Linear frequency theory; 
: Servo theory) 
Field theory, 9-10 
characterized as method, 9 
future developments in, 82-84 3 
(See also Gestalt psychology; Lewin- 
= ian theory) 
Fields of psychology, 17-18 
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Filter, low-pass, 646 
Fixation, 188 
as law of learning, 128-129 
(See also Rigidity) 
Fixed action patterns, 564, 573, 574, 
578-579, 583-584, 593, 596 
Food habits, 47, 72-73 
Force, conflict of, 27-28, 54 
direction of, in Lewinian and in pur- 
posive behavior theory, 136-137 
as interrelation of geometrical and dy- 
namic constructs, 61 
own, induced, and impersonal, 27-28, 


resultant, driving, and restraining, 26- 
(See also Decision; Equilibrium; Life 
space; Person; Psychological en- 
vironment) 
Force field, 26-27, 32, 53 
goal in, 53-54, 83 
Force mentale, 169-170 
Foreign hull of life space, 70-72 
(See also Life space) 
Foresight, 217, 246 
Forgetting, differential, 555-556 
Form discrimination, 493 
Formal organization of system (see Ax- 
ioms* ; Hypothetico-deductive meth- 
od*; Postulates*; Themes of analy- 
sis; Theorems) 
Fourier integral theorem, 649 
(See also Linear frequency theory) 
Fractional anticipatory goal response, 335 
as mechanism to relate variables, 311 
Frequency in learning, 125, 186 
Frequency analysis, 642, 645-651 
of speech, 638-639 
(See also Linear frequency theory) 
Frequency response curve, 645 
Frustration, in contiguity theory, 170 
in learning set theory, 511-512, 522 
in Lewinian theory, 28, 42, 55-56 
Function of behavior, 564 
Function-form specifications,* 670-671 
Function forms, construction of* (see 
Themes of analysis) 
Functional autonomy, 130, 263-264 
Functionalism, 543-544 


Galileian and Aristotelian thought, 48- 
50 


Game theory (see Probability learning) 
Gatekeepers, 47, 72 
Generalization, 133, 248-250, 253-254 
in discrimination learning, 127-128, 
337-339, 504-505 
language as cue for, 246 
as law of learning, 127-128 
of satiation, 141 
and Zeigarnik effect, 34-35, 40-41 
(See also Learning set and error fac- 
tor theory; Transfer; Transposi- 
tion) 
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Gestalt psychology, 95, 179, 186, 269-270 Hunger, degree of deprivation and re- 


(See also Field theory; Lewinian the- 


ory) 

Goal in force field, 53-54, 83 
Goal-directed drive, 271 
Goal gradient, 205 

(See also Conflict, theory of) 
Goal orientation, 244-246, 249-250 
Graph theory, 26n., 61-65 
Gravitation, law of, 168-169, 202 
Group dynamics, 35-37, 42, 79 
Guessing (see Probability learning) 
Guilt as related to time perspective, 54 
or and nongustatory stimuli, 104— 

10 


Habit-family hierarchy, 248, 251-252 
Habit strength, 169, 260, 300, 312, 313; 
317-318, 319, 320n., 386-387, 428 
as example of relationship between 
postulates and data, 304-306 
Habits, previous, effect of, on learning, 
112-113, 524, 525, 530, 545-546 
(See also Learning set and error factor 
theory; Learning sets) 
Hallucinatory behavior, 218 
Helplessness and neurosis, 275-276 
Heterogeneous summation, 589, 592 
Hierarchical system of nervous centers, 
567-568, 582-583, 586, 596-597 
History of system in mediating research* 
(see Themes of analysis) 
Hodology as extension of topology, 61 
(See also Lewinian theory) 
Homeostasis, 175-176 
Hope as related to time perspective, 54 
Hull, foreign, of life space, 70-72 
Hull-Spence approach, 293-348 
and empirical research, 330-347 
absolute vs. relational discrimination 
learning, 337-339 
continuity vs. noncontinuity in dis- 
crimination learning, 339 
correlated reinforcement, 339-347 
reinforcement vs. contiguity in 
conditioning, 331-335 
response vs. stimulus learning in 
conditioning, 335-337 
Hullian approach to theory, 298-309 
derivation of theorems, 306-308 
method of postulate construction, 
303-306 
nature of, 299-301 
quantification, 308-309 
relation to other areas, 301-303 
introduction, 294-298 
summary and conclusions, 347-348 
theoretical models, 309-330 
formalization, degree of, 328-330 
input interventing variables and re- 
lationships, 314-325 
output variables, 325-328 
Spence’s theory, 311-314 
Hunger, definitions of, 310 


sponse probability, 441-442 
as drive indicating next action, 174 
interaction with thirst, 223-225, 253, 
278-279, 317 
modified by learning, 263-264 
Hydraulic models, 567, 582-583, 585 
ypotheses in discrimination learning, 
339, 523-524 
Hypothetical constructs, 15-16, 57-58, 
97-98, 180, 582-587 
Hypothetico-deductive method,* 97-98, 
150, 173, 234, 295, 303-306, 368, 
369, 375, 539-540, 663, 667, 671- 
672 


(See ‘also Axioms* ; Deductions; Equa- 
tions; Method; Models*; Postu- 
lates*; Theorems) 


Ideas as drive, 270-271 
Ideographic approach, 51, 96, 163, 177- 
178, 309, 369-373, 376-377, 480 
(See also Prediction*) 
Images, 243 
Imprinting, 265, 594-595 ‘ 
See also Innate Releasing Mechanism) 
Incentive, 284-285, 313, 318-323 
(See also Reinforcement*; Reward) 
Incidental learning, 122-123 
Independent-intervening-dependent var- 
lable schema,* 667-668, 670 
Independent probability, 167, 484 z 
Independent variables* (see Variables*) 
Index numbers, explanation of, 674-675 
Individual differences, 372, 374-375 
Individual style, 177—178 
Inertia, 643 As 
Information, statistics and probability 
theory of, 629-630 
transmission of, 622-631 
cir ael capacity, 623, 624, 628-629, 
631 


continuous, 628-629 
equivocation in, 623, 625-626 
multivariate and correlational 
analysis in, 633-634 
noise in, 623-625, 628 
rate, 628 
theorem for, 626-628 
unit of, 615 
(See also Bit; Information theory) 
Information theory, 611-634 3 
application in behavioral studies, 629- 
63 


basic, 614-629 
introduction, 611-614 
and servo theory compared, 639-641 
Inhibition, 318, 386-387, 518, 526, 530- 
531 


(See also Reactive inhibition; Retro- 
active inhibition; Work inhibition) 
Initial evidential grounds for system 
(see Empiricism; Themes of analy- 
sis 
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Innate behavior, 574-575, 584-585, 589 
pants factors, in learned drives, 264— 
265 
in neurosis, 275 
Innate Releasing Mechanism, 584-585, 
595-597 
Inner experience, 174 
Input, comparator, 642 
forms used experimentally, 652 
Input-output system, 643-645, 648, 650- 
651, 653-657 
Input variables, 314-323, 639, 650 
(See also Information theory; Linear 
frequency theory; Variables*) 
Insight, 496-497, 510 
Instinctive behavior, 576-577, 579, 586n. 
Instrumental conditioning, 127, 146, 
150-151, 313, 388 
approach, 99-117 
avoidance, 119-120, 126, 333, 438-440 
escape, 117-119, 126 
movement patterns, 184-185 
reinforcement in, 334-335, 403-405 
token, 129-130 
(See also Conditioning; Learning* ) 
Integration of behavior sciences, 303 
Intelligence, 247-248, 262, 567 
as capacity for minimal rehearsal, 168 
estimated by learning sets in animals, 
507-508 
Intention movement, 577, 579, 595 
Interaction (see Analysis of variance; 
Experimental design) 
Interdependence, 18-19, 31-32 
and causation, 17 
simple and organizational, 29-30 
Interference, clinical use of, 190 
contiguity principles, 459-460 
in extinction, 190, 318, 400, 437-438 
retroactive and proactive, 428-429, 
545-546, 556 
Intergroup relations, 46-47 
nterproblem learning (see Learning set 
and error factor theory; Learning 
sets) $ 
Interrupted task (see Disruption experi- 
ments; Zeigarnik effect) 
Intervening variables* (see Variables*) 
Interviews, content analysis of, 228 
Intraproblem learning curve, 495-497, 
501, 511-512 
(See also Learning set and error factor 
theory; Learning sets) 
Irrelevant drive, 253-254 


Jordan curves, 59 
Knowledge by acquaintance, 163-164 


Language, data and construct,* 38-47, 
149, 384-385, 668-670 
of description,* data language* (see 
data and construct, above) 
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Language, of description,* as factual 
and public, 161-170, 173, 540 
generality of, 299-301, 400 
life-space system vs. stimulus-re- 
sponse, 233-234 
psychological vs. physical or physio- 
logical, 14-15, 96, 170, 174- 
178, 568-580, 583-587 
(See also Causation; Concepts*; 
Constructs*; Definitions; De- 
scription; Explanation; Mod- 
els*; Reductionism* ) 
functions of, 246, 273 
learning of, 273 
linear frequency theory applied to, 
638-640 
and mediation, 246 
as variable affecting behavior, 302 
Latency, laws relating to, 462 
as output variable, 326 
(See also Speed) 
Latent learning, 122-123, 125-126, 149, 
246, 259, 334-335 
Law, of Contiguity, 168-169, 171, 185- 
190, 262, 317-318 
of Effect, 95, 106n., 124-131, 171, 
173, 186, 262, 334 
of Exercise, 125-126, 128 
of gravitation, 168-169, 202 
of parsimony, 200, 262, 303, 311, 405 
Lawful behavior, 203, 362, 369 
contiguity theory and, 162-170 
Hull-Spence theory and, 295 
Lewinian theory and, 51-52 
in linear frequency theory, 657-660 
in Miller’s liberalized S-R theory, 238- 
252, 261-262 
in purposive behavior theory, 96 
Laws, of association, 168-169, 171, 185- 
1 


89 
(See also Law, of Contiguity, of 
Effect, of Exercise, learning, be- 


low 
ranei 295, 296-297, 299-301 
level of generality of, 299-300 
of conflict, 205 
(See also Conflict, theory of) 
of learning, 95, 163, 168-169, 261-262 
Contiguity (see Law) 
derivation of, 461-469 
Effect (see Law) 
Exercise (see Law) 
fixation, 128-129 
generalization, 127-128, 302, 404 
need-reduction, 126, 128, 130-132, 
240, 256-261, 265, 267-269, 331- 
332, 492-493 
quantitative, 129 
in response probability terms, 393- 
395 
Tolman’s, 124-130 
(See also association, above) 
of logic, 161, 162, 193, 200, 306, 461 
of nature, 162, 217 
Leadership, styles of, 45—46 
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Learning,* all-or-none, 403 
approach, 99-117 
as association by contiguity, 171-172, 
190-191, 193 
(See also Association by contiguity; 
Contiguity) 
associative, 460 
Se 119-120, 126, 333, 438- 
4 


classical conditioning, 127, 166-167, 
237, 312-313, 432-433, 444 
(See also Conditioning) 
cognitive theories of, 262 
(See also Principles of purposive 
behavior) 
comparative studies of, 445, 493-494, 
504-508 
(See also Ethology, some recent 
trends) 
conditions and principles, 272 
contiguity theory, 261 
(See also Association, by contiguity) 
definitions of, 395-396, 399, 453 
discrimination (see Discrimination 
learning) 
distributed practice and (see Dis- 
tributed practice) 
effect of drive on, 259-260, 492-493, 
531-532 
and effort involved in response, 246 
a 126, 184-185, 438- 


evolution of, 531 
frequency and, 125, 186 
imprinting, 265, 594-595 
(See also Innate Releasing Mecha- 
nism) 
incidental, 122-123 
as roh ung inhibition process, 530— 


of language, 273 

latent, 122-123, 125-126, 149, 246, 
259, 334-335 

laws of (see Laws) 

levels of, 244, 262 

maturation and, 595 

maze (see Maze learnin: 

ee 498, 503, 508, 518, 520, 


aus development of, 493, 594- 


perceptual, 269-270 
place vs. response, 335-337 
practice and, 112-113, 524, 525, 530, 
545-546, 557-558, 575 
(See also Learning set and error 
factor theo: 
probability, 410-415 
(See also Statistical learning theory) 
problem difficulty and, 520 
rates and asymptotes, 420-421 
recency in, 125 
response vs. stimulus, 335-337 
piep l 339, 410, 498, 500, 515, 521- 


SUBJECT INDEX 


Learning, * rote (see Rote learning) 
scanning as part of, 168, 169, 181, 
182, 188 
secondary reward, 129-130 
sensory preconditioning, 336 
sets (see Learning set and error factor 
theory; Learning sets) 
simple, 465-467, 494 
social, 272-276 
statistical systematization of, 4 
(See also Statistical learning theory) 
S-R (see Hull-Spence approach; 
Liberalization of basic S-R con- 
cepts) 
theoretical representation of, 453-454 
(See also Learning theory*; Mod- 
els*; Theory) 
Tolman’s definition of, 124 
Tolman’s laws of, 124-130 
Tolman’s paradigms, summary of, 123 
and transfer, 502-503, 504 
trial-and-error, 244-246, 328, 496-497 
verbal, paired-associate, and serial, 
442-443 > 
(See also Verbal conditioning) 
(See also Conditioning) 

Learning curves, 129, 190-191, 200-201, 
301, 366, 367, 372-375, 386-389, 
397-398, 407-408, 545, 671 

(See also Equations) 
intraproblem and interproblem, 495- 
497, 501, 512-513 

Learning to learn (see Learning set and 

error factor theory) 

Learning set and error factor theory, 

492-533 
concluding comments, 531-533 
error factor theory, 512-526 
application, 521-526 
nature of, 512-520 
intraproblem and interproblem learn- 
ing curves, 495-497 
introduction, 492-495 
learning set, 497-512 
cortical localization, 508-509 
curve, form of, 497-500 
and efficiency of learning, 501-503 
Sana formulation of, 503- 
0: 


ontogenetic and phylogenetic de- 
velopment and, 504-508 
significance of the theory, 509-512 
learning set, error factors, and uni- 
process theory, 526-531 
Learning sets, 4-5, 252, 410 1 
discrimination, discrimination-reversal, 
and oddity, 498, 503-508, 518 
formation of, 494-495 N 
with limited trials in discrimination 
problems, 496-497 
primate and subprimate, 504-508 
and probability, 503-504 
(See also Learning set and error factor 
ty; Practice) 
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Learning theory,* 5 
applied to advertising and education, 


classical, 480-481 
deterministic compared with proba- 
bilistic, 386-387 
inhibition as fundamental process in, 
530-531 
statistical (see Statistical learning 
theory) 
status of contemporary, 158-159, 193, 
200-203, 261-262, 386-388, 493 
quantitative, 491-486 
uniprocess, 526-531, 533 
(See also Systematic formulations; 
Theory) 
Level, of analysis,* 666 
behavioral vs. neurological, 14-15, 
96, 170, 199-203, 238-242, 295, 
299-303, 363, 383-386, 564-565 
from complex to simple, 494 
in correlated reinforcement, 341— 
343 
in general vs. particular terms, 400, 
58 
and logic of science, 50-51, 539 -540 
ontogenetic emphasis of, 493 
from qualitative to quantitative, 566. 
567 
(See also Causation; Description; 
Explanation; Language of de- 
scription*; Method; Models*; 
Quantification* ; Reductionism* ) 
of aspiration, 42-43 
of reality, as property of life space, 
24 


Lewinian theory, 7-84 
background factors and orienting atti- 
tudes, 10-21 
concreteness, 16-21 
description and explanation, 13-16 
open-minded observation, 11-13 
task of psychology, 10-11 
conclusions, 80-84 
formal organization of system, 65-80 
initial evidential grounds for assump- 
tions of, 38-47 
mensurational and quantification pro- 
cedures, 58-65 
method of construction, 48-58 
miniature systems in, 65 
reflexes in, 80 
structure of system, 21-38 
concept of person, 29-31 
life space, 22-24 
psychological environment, 24-29 
relations between person and envi- 
ronment, 31-35 
social relations, 35-37 
summary, 37-38 
Liberalization of basic S-R concepts, 196- 
286 
approach-avoidance conflict behavior, 
199-238 
background factors, 199-204 
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Liberalization of basic S-R concepts, 
approach-avoidance conflict be- 
havior, extensions to psycho- 
therapy, 226-230 

general evaluation, 230-234 
inclusion of stimulus similarity and 
displacement, 217-226 
intermediate and long-range strategy, 
234-238 
postulates, deductions, and variables 
in first stages, 204-217 
introduction, 198-199 
and Uberalvation of S-R theory, 238- 
basic methodological questions, 276- 
5 


experimental design required, 
276-280 
implications for psychopharma- 
cology and central stimula- 
tion, 280 
impure measures vs. specific 
factors, 280. 
quantification problems, 280-285 
effect of carrying variables 
through entire range, 284- 
285 
effects of inequalities in scale 
units, 281-283 
gradual approximation of exact 
form of functions, 285 
no type of operation assııres 
equal-interval scale, 283-284 
strategy of using qualitative 
tests first, 281 
changes in basic concepts, 238-252 
new trails for research on motiva- 
tion, 262-272 
selected controversial issues, 253-2 72 
social learning, personality, an 
psychotherapy, 272-276 , 
Life space, Brunswik’s characterization, 
definition of, 66 
dimensions of, 23 
ecology and, 72-80 
essential features of, 22-24 
force in, 26-28, 32, 36-37, 54 
foreign hull of, 70-72 
formulation of construct of, 65-72 
the person and, 22 
and phase space, 18-19 
psychological environment and (see 
Psychological environment) 
structural and dynamic properties of, 
23 
time perpective as property of, 23-24 
topology applied to, 59-65 
(See also Person) 
Limits of Prediction.* 666 
(See also Prediction*) 
Linear frequency theory. 637-660 
applications of, 646 K 
experimental, to human tracking be- 
havior, 651-652 
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Linear frequency theory, feedback cir- 
cuit, 641-642 
frequency analysis, 642-645 
and human behavior, 647-651 
introduction, 637-639 
linearity and, 645-646 
linearity of human response, 652-657 
man-machine system performance, 
639-641 
methodological commentary, 657-660 
Linear graphs, and directed graph, 63 
and irrevocable path, 63 
and planar maps, 61-63 
and signed graphs, 64 
and strength, 64-65 
Linearity, 645-646, 648-651 
of human response, 652-657 
Locomotion, coordinated to behavior, 
69 
in psychological environment, 25, 53 
Logic, laws of, as conventions, 161, 162, 
193, 200, 306, 461 
of science, 50-51, 539-540 
(See also Axioms*; Hypothetico-de- 
ductive method*; Postulates* ; 
Theorems) 
Loop circuit (see Feedback; Servo 
theory) 


Machines (see Linear frequency theory; 
an-machine systems) 
Man machine systems, 637-641, 643- 
64. 


(See also Linear frequency theory; 
Operator; Servo theory) 
Markov processes, 430, 483-484, 611, 
620-622 

(See also Information theo 
Massed practice, 408-410, 438, 444 

(See also Distributed practice) 
Mathematical variables,* 669 
Mathematico-deductive theory, 539-540 

(See also Hypothetico-deductive 

method*) 

Mathematics* (see Algebra: Analysis; 
Equations; Experimental design; 
Graph theory; Learning curves: 

easurement; Method; Models*; 
Prediction* ; Probability; Quantifica- 
tion*; Scaling; Statistics) 

Matrix algebra, 280 

Maturation, of concepts, 252 

of form discrimination, 493 
and learning, 595 
aze learning, labyrinthian and straight 
runway, 203, 532 
probability approach and, 410-411, 
457-458 


Tolman’s conception of, 120-129. 
142-144, 336-337 
Meee coe readinesses, 105-107, 112- 


in escape training, 118-119 
magnitudes of, 107 
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Means-end readinesses, measured by 
transfer experiments, 116 
pointer-readings for, 111-112, 1 13n., 
114n. 
telescoping of, 106 
two types of, 112n. 
a Psychological, 473-474, 
671 


(See also Algebra; Analysis; Equations; 
Experimental design; Graph the- 
ory; Learning curves: Method; 
Models* ; Prediction*; Probabil- 
ity; Quantification*; Scaling; Sta- 
tistics) 

Measures, of drive, need for, 260, 278-280 

nonequality of units, 211 

unit of measurement, 308-309 d 

(See also Method; Quantification*; 
Scaling) 

Mechanism, Innate Releasing, 584-585, 
595-597 

motor coordination, 584 4 

to relate independent and intervening 
variables, 311, 317, 327 

Mediation, language and, 246 
of learned drives, 271 
en amount, as function of reward, 
31 


dynamic studies of, 39 € 
(See also Extinction; Forgetting; Re- 
call; Retention; Retroactive in- 
hibition; Zeigarnik effect) 
Mensurational and quantificational pro- 
cedures* (see Mathematics*; Meas- 
urement*; Themes of analysis) 
Mental hygiene, 276 
(See also Clinical derivations of S-R 
theory; Psychotherapy) 
Message, 611 
channel theorem, 611, 613 E 
Coding Theorem, 611, 613, 621-622 
short, applications of information 
theory in, 621-622 
(See also Information) 
Metatheory, 9 P 
Method, of abstraction and construction, 
48-58 
hypothetico-deductive,* 97-98, 150, 
173, 234, 295, 303-306, 369, 375, 
539-540, 663, 667. 671-672 
of naturalistic observation, 267-268, 
271, 566 
scientific, prestige of, 360-361 
of statistical learning theory construc- 
tion, 469-474 
of successive approximation, 56-58, 
233-234, 285-286, 621-622 
variations in procedure, 546-549 
(See also Experimental design; Graph 
theory; Models*; Quantification*; 
Scaling) 
Methodology,* 663-673 
Methods, concepts, principles, valuable 
outside context of system* (see 
Themes of analysis) 
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Mind as ability to learn, 174 
Miniature systems, 217, 485-486, 539 
in Lewinian theory, 65 
Model and real organisms, 300-301, 612 
Models,* building of, as part of treating 
in psychology, 360 
deterministic growth, 386-387 
frequency analysis, 648-651, 657-660 
general mathematical, 5, 639, 647, 657 
hydraulic, 567, 582-583, 585 
information theory, 387, 551, 611, 613, 
629, 639 
for learning set data, 503-504 
Poisson, 387 
psychological and physiological, 15, 
369, 551 
in purposive behavior theory, 96 
and relation to empiricism, 80, 375 
(See also Empiricism) 
and relation to theory, 172, 202, 663 
response, 450—451 
probabilistic, 392-395 
in servo theory, 639 
statistical, 387, 423-430, 446-447, 458 
stimulus and stimulus fluctuation, 
451-452 
stochastic, 387, 429-430, 481-486 
theoretical, in Hull-Spence approach, 
309-330 
topological, 59-65 
utility and role of, 666 
(See also Method; Quantification*; 
Theory) 
Molar vs. molecular interpretations, 300- 
302, 391, 397, 562 
Mood, 587 
Motivation,* 385 
exploratory-curiosity, 493 
(See also Curiosity) 
functional autonomy of, 130, 263-264 
internal, overemphasized, 493 
Lewinian treatment of, 77 
need-reduction (see Need-reduction) 
new trails for research in, 262-272 
ontogenetic development of, 493, 511— 
512 
secondary reinforcement, 129-130 
summation of, 263 
theory of distributed practice. 553-554 
Tolman’s conception of, 130-132 
(See also Deprivation; Drive*; Drive- 
stimulations; Hunger; Motives; 
Need-reduction; Reinforcement*; 
Reward) 
Motives, effect on perception of, 108n., 
130-132 
external vs, physiological, 227, 493, 
531-532 
fear as basis for variety of, 266 3 
(See also Deprivation; Drive*; Drive- 
stimulations; Hunger; Motiva- 
tion*; Need-reduction; Rein- 
forcement*; Reward) 
Motor behavior, analysis of, in linear 
frequency theory, 638 
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Motor pattern as characteristic of appeti- 
tive behavior, 572—577 
Motor skills, intersubstitutability of, 132- 
133, 149 
unconscious acquisition of, 262 
Motorium, 70-72, 74-77 
Movement, intention, 577, 579, 595 
response as, 183-190 
(See also Acts) 
Movement-produced stimuli, 180, 181, 
188-189 


National Science Foundation, vi, vii, 6 

National Training Laboratories in Group 
Development, 45 

Natural science, 50, 183, 541, 664-665 

Naturalistic observation, 267-268, 271, 
566 


Need-reduction, 126, 128, 130-132, 240, 
256-261, 265, 267-269, 331-332, 
492-493 

(See also Deprivation; Drive*; Drive- 
stimulations; Law, of Effect; 
Motivation*; Motives; Reinforce- 
ment*; Reward) 

Needs, 312, 316-317 

quasi, 32-33, 39 

(See also Deprivation; Drive*; Drive- 
stimulations; Motivation*; Mo- 
tives; Need-reduction; Reinforce- 
ment*) 

Negative valence (see Valence) 

Neurophysiology, 565, 583-584, 596-597 

Neurosis, combat, 230, 267-269, 275-276 

experimental, 205 

innate factors in, 275 

learning of, 274 

Newtonian mechanics, 664 

Noise, random, 654 

transmission, 623-625, 628 

white, 652 $ 

Nomothetic approach (see Ideographic 
approach; Prediction*) 

Noncontinuity vs. continuity, 339, 497- 


Nonlinearity, 652-657 
Nonreaders, 180 
Nonreinforcement, effects of, 415-420. 
437-438 
Nonsense syllables, 541 
(See also Calibration) 


Obesity, methods of studying, 376-377 
Objectivity, 164-170, 173, 174, 328-329, 
375 


Observation, concreteness of, 16-21 
as influenced by formal theory, 12, 
377-378 
(See also Empiricism) 
natural-history type, 236, 267-268, 
271, 566 
Observer, role of, minimized, 164-165. 


169 EN 
(See also Objectivity) 
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Oddity problem, 249, 498, 503, 508, 
518, 520, 523 
(See also Learning set and error factor 
theory) 

Operant conditioning (see Conditioning; 
Instrumental conditioning) 

Operational definition,* 50-51, 201, 658, 
669, 670 

Operationism, 384, 396 

(See also Empiricism) 

Operator, in input-output relationships, 

648-657 
(See also Man-machine systems ) 

Operators, trial, 467 

Ordinal scales (see Scaling) 

Organization of system, formal* (see 
Themes of analysis) 

Orientation, as characteristic of appeti- 
tive behavior, 572-575, 576n., 592, 
593, 594 

goal, 249-250, 271 

Orienting attitudes* (see Themes of 
analysis) 

Origin of Species, The, 160 

an and conflict theory, 232-233, 
235 


inhibiting excitatory potential, 312, 
325 


in linear frequency theory, 642-645 
as treated by Guthrie, 194 
Output variables, 325-328, 650 
(See also Input-output system; Linear 
frequency theory; Variables*) 
Overflow activities, 592 
erlapping situations, 
behavior, 28-29, 54 
and potency, 29 
(See also Equilibrium; Life space; 
Psychological environment) 


influence of, on 


Pain, 279 
Pain-avoidance, 131, 317, 373 

(See also Drive*; Motivation*; Need- 

reduction; Reinforcement*) 

Paired-associate learning, 442 
Parameters, invariance of, 405-410 

of reinforcement, 433-436 
Parsimony, 200, 262, 303, 311, 405 
el reinforcement, 332-334, 345, 368, 


and expectancy theory, 124n. 
fixed-interval, 368-369 
fixed-ratio, 368-369 
multiple fixed-interval fixed-ratio, 374 
and resistance to extinction, 436-438 
Path, irrevocable, 63 
etween regions of Psychological en- 
vironment, 26 
as treated in graph theory, 62 
(See also Psychological environment) 
Pattern, fixed action (see Fixed action 
Patterns) 
of movement response, 184-185, 242 
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Pattern, of stimuli, 179-180, 187, 192. 
193, 254, 316, 338-339 
(See also Transposition) 
Perception,* 385 
and central responses, 242-244 
and drive elicitation, 269-270 
as influenced by motivated organism, 
130-132 
as intervening variable, 107-108, 116- 
117 


ontogenetic development of, 493 
social, 44, 73 
(See also Attention) 
Perceptual definition of stimuli, 165-166 
Perceptual discrimination spaces, 138- 
142 
Performance, consummatory, 137 
defined, 147 
as distinct from learning, 148-149 + 
exploratory scanning or identification, 
137-138, 140, 142, 146 
pragmatic, 137-138, 140-146 
Principles of, 132-146, 337 
interaction between performance 
vectors, 135-137 
for motor skills, 132-133 
Perceptual discrimination spaces, 
138-140, 142 
performance vector, 134-135 
Prediction of performance, 133-135 
repression, 146 
vs. response, 95 
Perseveration theories of distributed 
practice, 550-551, 556-558 
ee also Error factor analysis) 
Person, as energy system, 170 
as treated by Lewin, 29-35, 66-67, 
70-72 x 
(See also Life space; Psychological 
environment) 
Perspective, 19-20 
(See also Time perspective) 
hase sequence, 243, 270 
ase space, 18-19 
(See also Life space) 
henomena, time-dependent, 463-464 
Phenomenal field, 67-68 p 
(See also Life space; Psychological 
environment) 
Physicalism, 383 
(See also Reductionism*) 
ysics, development of, as science, 50- 
51, 164, 173, 174, 191, 202, 203 
and relation of psychology, 383, 565 
Physiology and relation to psychology, 
303, 565 
Physiology of the Brain, 361 
En as psychophysical observers, 371, 


Pill machine, 367-368 J 
ce learning vs. response learning, 
335-337 
Planar maps and linear graphs, 61-63 
Pointer readings, of independent var- 
iables, 109-11 1, 119 
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Pointer readings, of intervening varia- 
bles, 111-116, 119 
summary, 116-117 
Position as represented in psychological 
concepts, 53 
Position habit, 513, 518 
Positive reinforcement (see Reinforce- 
ment*) 
Positive valence (see Valence) 
Postulates,* 295, 300-302 
of conflict theory, 205-206 
and deductions, 206-213, 218-219, 
228-230 
differences between Hullian 1943 and 
1949 sets of, 319-320 
method of construction, 303-306 
Physiological interpretations of, 301— 
2 


quantification of, 308-309 
(See also Axioms*; Hypothetico-de- 
ductive method*; Theorems) 
Power, determinants of, 37 
Power field, 36-37, 54 
(See also Force) 
Power spectra, 652, 658 
Practice, influence of, on learning, 112- 
113, 524, 525, 530, 545-546, 557- 
558, 575 
(See also Learning set and error fac- 
tor theory) 
Preconditioning, sensory, 336 
Prediction,* 202, 295, 327, 564-565, 
665, 666 
absolute, 176-177, 189, 193-195, 480, 
550-551 
clinical and statistical, 51, 96, 370 
through extrapolation of variable to 
limits, 284-285 
as limited to classes of events, 162- 
163, 203 
macromolar vs. micromolar, 341-343 
and parameter estimation, 407-410 
as principle of performance, 133-135 
from single construct, 297-298 
for substantive vs. formal theory, 612 
with unequal scale units, 281-285, 
308-309 
of values of intervening variables, 148 
of weather, 176-177 
(See also Ideographic approach; 
Level of analysis*; Models*; 
Probability) 
Prejudice, 46-47 
rimate and subprimate learning sets, 
504-508 
Principles, vs. conditions, 203 
contiguity-interference, 459-460 
of performance (see Performance) 
of purposive behavior, 92-152 > 
background factors and orienting 
attitudes, 94-98 
analysis level, 96 
empirical reference, comprehen- 
siveness of, 97 
formal organization, type of, 97-98 
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Principles, of purposive behavior, back- 


ground factors and orienting 
attitudes, models, utility and 
role of, 96 

psychological prediction, nature 
and limits of, 96 

quantitative and mensurational 
specificity, degree and mode of, 


conclusion, 152 
methodological considerations, 146- 
1 
construction of function forms, 
149-150 
evidence, 151 
formal organization, 150 
history of research mediation, 151 
initial evidential grounds for as- 
sumptions, 148-149 
mensurational and quantifica- 
tional procedures, 150 
methods, concepts, principles, val- 
uable outside system, 151 
programmaticity, 151 
scope or range of application, 
150-151 
strategy for development, 151-152 
variables, systematic and empiri- 
cal, definitions of, 147-148 
and probability, 96-97 
structure of, 98-146 
approach training, 99-117 
avoidance training, 119-120 
choice-point learning, 120-122 
early in learning and latent and 
incidental learning, 122-123 
escape training, 117-119 
laws of learning, 124-130 
motivation, 130-132 
performance principles, 132-146 
summary of the five learning 
paradigms, 123 
Probability, contiguity theory and, 163 
of CR occurrence, 313, 327 
dependent, 430, 483-484 
(See also Markoy processes) 
independent, 167, 484 
in information theory, 617-630, 632- 
634 
learning sets and, 508-504 
Lewinian theory and, 51 
purposive behavior theory and, 96-97 
of response (see Response) 
of stimuli, 167, 169 
(See also Models*; Prediction*; 
Quantification*; Response Cer- 
tainty; Statistical learning theory; 
Statistics) 
Probability learning, 410-415 
Probability theory of drive, 440-442 N 
(See also Models*; Statistical learning 
theory) 
Programmaticity,* 151, 231, 234—238, 
297, 330, 446-447, 558-559 
(See also Themes of analysis) 
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Project A (see Study of psychology) 
Projective tests and conflict theory, 220— 
221 
Psychical reality, 20, 23-24 
Psychoanalysis, 199, 274 
(See also Psychotherapy) 
Psychoanalytic causation, 20-21 
Psychological environment, cognitive 
structure of, 25-26, 53, 83-84 
definition of, 67-68 
force and conflict in, 26-28, 32, 36- 
37, 54 
locomotion in, 25, 53 
overlapping situations in, 28-29, 54 
regions in, differentiation of, 24-25, 
31, 61-64 
paths between, 26, 62 
relations between person and, 31-35, 


(See also Life space; Person) 
Psychological force (see Force; Psycho- 
logical environment) 
Psychological space, 59-64 
(See also Life space; Psychological 
environment) 
Paychology, relation of, to sociology, 272, 
3 


of science, 539 
Psychopharmacology, 280, 373 
(See also Drugs) 
Psychophysical observers, pigeons as, 
371, 376 
Psychophysics as method for studying 
reinforcement, 258-259 
Eee and conflict theory, 226— 


counterconditioning in, 25 
fear-reduction and, 267-269 

as learning, 273-276 

need to Be stimulus change in, 


paradoxical negative effect in, 229- 
230 


as profession for psychologists, 275 
scaling of, 227-228 
(See also Clinical derivations of S-R 
theory) 
cn behavior, quantitative study of, 


Punishment, as beneficial to learning, 
125-126 
and effect on recall, 144-145 
reduction as reward, 256-261 
and reward in Lewinian theory, 36 
Pure-stimulus act, 302 
Purpose, 174-176 
(See also Principles, of Purposive be- 
havior) 
Purposive behavior theory (see Princi- 
ples of purposive behavior) 
Pursuit tracking, 651-652 
Puzzle box, 184, 190, 203 
Pythagorean theorem, 200, 202n. 
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Qualitative functions,* 670-671 
Quantification,* difference equations, 
use of, 400-402, 467 
in ethology, 567, 587-588 
experimental designs and quantifica- 
tion problems in Miller’s liberal- 
ized S-R theory, 276-286 
approximation of equal-interval 
scales, 285-286 
effects of unequal units, 281-285 
in De set theory, 503-504, 512- 
520 


number of subjects, 369-370, 546 
(See also Ideographic approach) 
ordinal vs. cardinal, 150, 233-234, 
281, 587-588 ` 
place of mathematics, in contiguity 
theory, 169, 192-193 
in Lewinian theory, 58-65 
in purposive behavior theory, 96- 
97, 129, 150 
in el learning theory, 473- 
4 
of postulates in Hull-Spence approach, 
308-309 
in psychotherapy, 227-228 
(See also Algebra; Analysis; Equa- 
tions; Experimental design; 
Learning curves; Method; Mod- 
els*; Prediction* ; Probability; 
Scaling; Statistics) 
Quantificational and mensurational pro- 
cedures* (see Mathematics*; Meas- 
urement*; Quantification*; Themes 
of analysis) r 
uantitative techniques and specificity, 
567, 671 
(See also Level, of analysis* ; Models*; 
Quantification*; Statistics) 
Quasi need, 32-33, 39 


Random behavior, 519-520, 523 
Range of application of system* (see 
Themes of analysis) 
Rate of response, 327-328, 366, 373-375 
Rational equations, 200-201, 671 
Reaction potential, 177, 319, 323, 324, 
326, 428, 449 
compared with Tolman’s performance 
vector, 134-135 
Reaction Specific Energy, 586 
Reactive inhibition, 312, 324, 526 
compared with Tolman’s drive for 
little effort and/or certainty, 102- 
103 


(See also Inhibition; Work inhibition) 
Reafferenz model, 568 
Recall, delayed and immediate, 144-146 

tension and, 33, 39 

(See also Zeigarnik effect) 
(See also Forgetting; Memory; Reten- 
tion; Retroactive inhibition) 

Recency in learning, 125 
Receptor-orienting acts, 316, 339 
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Reduction sentences, 201 
Reductionism*, 96, 164-167, 171, 174, 
238-242, 295-296, 301-303, 362, 
383, 385, 564-565 
(See also Language, of description*; 
Level, of analysis*) 
Redundancy, 632 
Reflex reserve, 369, 428, 582 
Reflexes, 178, 576-577 
in Lewinian theory, 80 
Region, central-peripheral and inner- 
outer, 31 
and linear graphs, 61-64 
operational definition of, 24-25 
paths between, 26 
(See also Life space; Psychological 
environment) 
Regression to more primitive error pat- 
terns, 523-524 
Rehearsal theory, conscious and inten- 
tional, 557-558 
Reinforcement,* 256-262, 311, 386-387, 
403-405, 460 
amount of, 318-323, 434-436 
in approprh'ayoidanga conflict, 205- 
9 


brain stimulation as, 258, 260, 263n., 
280 
in classical conditioning, 331-334 
cognitive theories of, 262, 339 
in contiguity theory, 171, 183, 193, 
261, 403-405 
correlated, 298, 318n., 339-347 
delay of, 318-323 
determinate and nondeterminate, 415- 
420, 466-467 
drive-stimulus reduction hypothesis of, 
256-261 
(See also Need-reduction) 
emotion as, 170, 267 
(See also Fear) 
experiments involving unusual, 257- 


hedonistic, 257 

in instrumental conditioning, 334-335, 

_ _ 403-405 

in learning set theory, 533 

and AR a 415-420, 437- 

8 

operationally defined, 171, 403 

parameters, 433-436 

Partial (see Partial reinforcement) 

Positive, 378 

psychophysical studies of, 258-259 

punishment as, 125-126 

random, 410-415 

related to satiation, 257, 269 

secondary, 129-130, 254, 261, 320 

in statistical learning theory, 458-460 

two-factor theories of, 261-262, 317, 
403-405 

(See also Drive*; Law of Effect; Mo- 
tivation*; Motives; Need-reduc- 
tion; Reward) 

Relativity theory, 664 
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Releasers, 493, 564, 581, 584n., 595 
(See also Innate Releasing Mecha- 


nism) 
Reliability, 640 
Reminiscence, 549 
(See also Distributed practice) 
Representations as intervening variables, 
107-108, 116 
Repression, 226-227, 247, 274-275 
due to approach and avoidance vec- 
tors, 146, 152 
Research Center for Group Dynamics, 
38n., 39, 61 
Research Center for Human Relations, 
47 


Research instigated by system (see Em- 
piricism; Themes of analysis) 
Reservoir model, 567, 582-583 
Resistance, to extinction, 128-129, 313, 
327, 428, 436-438 
in psychotherapy, 275 
Resonance, 644, 646 
Respondent conditioning (see Classical 
conditioning; Conditioning) 
Response,* central cue-producing, 242— 


definition of, 251-252 

description of classes of, 569-571 

drive-producing, 242-248 

in ethology, 593-594 

flexible cybernetic, 248-250 

hierarchy of, 241, 251-252 

generalization of (see Generalization; 
Transfer; Transposition) 

eomp pibo 313, 318, 320-322, 324- 
39 


latency of, 326 
macromolar vs. micromolar, distinc- 
tion, 341-343 
probability of, 313, 327 
(See also Response certainty; Sta- 
tistical learning theory) 
degree of deprivation and, 441-442 
and proprioceptive feedback, 250-251 
rate of, 327-328, 366, 373-375 
and reflex, 576 
resistance to extinction of, 128-129, 
313, 327, 428, 436-438 
serial, 180-181 
(See also Maze learning) 
speed of, 326 
as viewed, by Estes, 392-395, 399, 
457-458, 462 
by Guthrie, 166-167, 179, 183-185, 
190 
by Hull and Spence, 297-298 
by Miller, 239-242 
by Tolman, 95, 100, 147 
(See also Acts; Behavior*; Fixed ac- 
tion patterns; Models*) 
Response amplitude, prediction of, 313, 


Response certainty, 167, 171, 195, 261, 
426-427 
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Response generalization (see Generaliza- 
tion) 
Response learning vs. stimulus learning, 
335-337 
Response model (see Models* ) 
Response strength, 166-167 
and relevance of drive (see Conflict, 
theory of; Drive*) 
Restraining force, 26-28, 36-37 
Resultant force, 26, 68-69, 143-144 
Retention, 426-429 
(See also Forgetting; Interference; 
Memory; Recall; Retroactive in- 
hibition) 
Retroactive facilitation, 545 
(See also Transfer) 
Retroactive inhibition, 428-429, 545 
546, 556 
Reward, brain stimulation as, 258, 260, 
263n., 280 
in gon ieni theory, 171, 183, 433- 
436 


as defined by Miller, 240, 256-261 

delay of, 318-323 

and hours of deprivation, 305-306 

learnable, 263-266 

magnitude of, 433-436 

operationally defined, 171 

and punishment in Lewinian theory, 
36 


punishment reduction as, 256-261 
token, 129-130 
(See also Need-reduction; Reinforce- 
ment*) 
Rigidity, 42 
and extinction, 128-129 
(See also Satiation) 
Ritualization, 584, 595 
Rote learning, 5, 538-559 
background factors and orienting atti- 
tudes, 540-544 
final word, 558-559 
introduction, 538-540 
some characteristic problems and the- 
ories, 544-549 
standardizing conditions, 546-549 
unit of material, 544-545 
well-practiced subject, 545-546 
typical problem and related theories, 
549-558 
classification of theories, 550-553 
San forgetting theory, 555— 


perseveration theories, 556-558 

work theories, 553-555 
Russell’s paradoxes, 160-161 
Russian device, 370-371 


Saccharine, reducing hunger by, 258 
Salmon runs, 193-194 
Satiation, 30, 41-42, 130n., 140-141, 
257, 269 
and “co-satiation,” 42 
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Scaling, inequalities in scale units, 281— 
285, 308-309, 588 
of material to be learned, 544-545, 
547-549 
ordinal and cardinal, 150, 233-234, 
281, 587-588 
of psychotherapy, 227-228 
of social competence in brain-injured, 
164-165 
speed, time, strength-of-pull, 282-283 
Scanning, 168, 169, 181, 182, 188 
Schedules of reinforcement (see Partial 
reinforcement) 
“Schematic sowbug,” 138n., 149 
Science, analysis in, 4 
of behavior, v 
biological and social, 3 
development of physics as, 50-51, 164, 
173, 174, 191, 202, 203 
history of, v 
integration of behavioral, 303 
logic of, 4, 539 
of man, v 
methodology of, 665-666 
(See also Method; Models*; The- 


ory) 
nature of, 161-170, 173, 200-201 
Perspective of Study on, 4 
Psychological (see Study of psychol- 
ogy, Project A) 
of science, 4 
Scientific dicta, 11, 93-94 
Stlentifo micthod, case history in, 359- 
379 


specific, 361-379 
introduction, 359-361 
Scope of system*, 81-82, 172-178, 193, 
aa 471-472, 474-475, 612- 
14 


(See also Themes of analysis) 
Searching behavior (see Appetitive be- 
havior) 
Secondary reinforcement (see Reinforce- 
ment*; Reward) 
Selective frequency amplification, 641- 
642 


Sensitivity to signals, 641, 646 
Sensorium, 70-72, 75 
Sensory preconditioning, 336 
Serendipity, 369 
Serial position effects, 442-443 
Serial response, 180-181 
(See also Maze learning) 
Serial verbal learning, 442-443 
Servo theory, 639-649 3 
feedback circuit and selective ampli- 
fication, 641-642 
(See also Linear frequency theory; 
Man-machine systems) 
Sex, as dependent drive, 264 
as drive indicating next action, 174 
increase of, as affecting gradient of 
stimulus generalization, 254-255 
‚as irrelevant drive, 253 
Sign-gestalten, 95 
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Signal, 179-180, 182-183, 185, 186, 
190-191 
input and output, 640, 641-642 
tracing of, 639 
Signal amplifier, 642 
(See also Amplifying systems) 
Similarity in Zeigarnik effect experi- 
ments, 34-35, 40-41 
Sine waves, 639, 644-645, 649-651, 652 
Skills, acquisition of, 180, 183 
motor (see Motor skills) 
Skinner box, 95, 110-111, 116, 185, 203, 
532-533 
Social drive, 266 
Social perception in Lewinian theory, 


Social relations, emotional security and, 


group membership valences in, 35-36 
intergroup, 46-47 
power in, 36 
(See also Group dynamics) 
Sociology and relation to psychology, 
272, 303 
Song in birds, 595 
Space of free movement in Lewinian 
analysis, 28 
(See also Life space; Psychological 
environment; Psychological space) 
Spatial learning, 336-337 
Specific Action Potential, 586 
Speech, linear frequency theory applied 
to, 638-640 
as selection process, 614 
(See also Information theory; Lan- 
guage) 
Speed scores, compared with time and 
strength-of-pull scores, 282-283, 298 
related to reaction potential, 326 
Spence’s theory, summary of, 311-314 
(See also Hull-Spence approach) 
Spontaneity, 582-583 
Spontaneous alternation, 518 
Spontaneous recovery, 428, 463-464 
Standard-defining experiments, 147-148, 
456, 546-549 
Static, 646 
(See also Noise) 
Statistical learning theory, 380-486 
development of, 386-445 
growth since 1950, 405-445 
extension to new empirical areas, 
430-445 
generalizations of original model, 
423-430 


tests of theoretical assumptions, 
405-423 
origins and initial formulation, 386- 
405 
orienting attitudes and systematic 
viewpoint, 382-386 
reinforcement in, 458—460 
systematic and comparative analysis, 
445-486 
evaluation of system, 476-480 


Statistical learning theory, systematic 
and comparative analysis, 
formal structure of system, 
445-469 

interpretation in terms of empiri- 
cal variables, 456-469 
itemization, of empirical depend- 
ent and independent vari- 
ables, 454-456 
and mode of definition of sys- 
tematic constructs, 447-450 
major interrelations among con- 
structs, 450-454 
interrelations with other systems, 
480-486 
methodology, 469-474 
range of application, 474-476 
Statistics, critical ratio and analysis of 
variance, 444 
to determine probability of correct re- 
sponse, 328 
for information measure, 629-630 
as treated, by Guthrie, 167, 169, 177 
by Lewin, 58-65 
by oe 360-361, 369-373, 375- 
376 


by Tolman, 96-97, 129, 150 
(See also Equations; Experimental de- 
sign; Learning curves; Method; 
Models*; Prediction*; Probabil- 
ity; Quantification*; Scaling; 
Statistical learning theory) 
Step functions, 649-650 
Stickleback, 591, 593 
Stimulus,* ambiguity of, scaling. 435-436 
compounding and transfer, 423, 457, 
464-465 
consummatory, 573-574, 576, 592, 595 
CS, 166 
definition, by Estes, 455-457 
by Guthrie, 178-179 
by Miller, 239-242 
distal and proximal, 314-316 
drive as, 253-256 
effects, 592 
eliciting, 576, 584-585 
in ethology, 581-582, 591-592 
functional, 179 
internal, 583 
meaning of, 165-166 
movement-produced, 180, 181, 188- 
189 
patterning (see Pattern) 
physical, 166-167, 178 
sampling, 424, 451 
sign, 584-585, 589, 591-592 
as signal in information theory, 630 
situational and effective, 314-316, 
591-592 
threshold, 592 
(See also Threshold) 
variability, 421-429, 452 
variables, 399 
Stimulus generalization, 127, 218, 221— 
222, 253-254, 465, 511 
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Stimulus generalization, gradients 
plotted in time, speed, and strength- 
of-pull scores, 282-283 

(See also Learning set and error fac- 
tor theory; Transposition) 

Stimulus interaction, afferent, 316 

Stimulus-maturation hypothesis of dis- 
tributed Practice, 558 

Stimulus preference, 513-515, 524-595 

(See also Error factor analysis) 

Stimulus-response relations, 452-453 

S-R theory (see Association by contigu- 
ity; Hull-Spence approach; Liberal- 
ization of basic S-R concepts; Scien- 
tific method; Statistical learning 
theory) 

Stimulus-saturation theory of distributed 
Practice, 554 

Stimulus similarity, 217-225, 240 

(See also Stimulus generalization) 

Stimulus and stimulus-fluctuation model, 
451-452 

Stochastic process, 612-614 

Strain toward symmetry, 79 

Strategy for development of system* (see 
Themes of analysis) 

Structure of system* (see Themes of 
analysis) 

Study of Psychology, Project A’, v-vii, 
1-6, 663-675 

conceptual and systematic (Study I), 
v-vi, 1-2, 663-675 
epilogue to, 5 
grounds for selection of systematic 
formulations, 2 
trends, for sensory psychology, sup- 
plement on, 5 
design of, 1-3 
empirical substructure and relations 
with other sciences (Study IT), 
vi, 2-3 
Panel of consultants, vi, vii, 6 
Postscript volume, vi, 3,5 
special features of, 3-4 
steering committee, vi, vii, 6 
(See also Themes of analysis) 

Subgoals, 129-130 

Subresponse, 185 

Substitute valence, 34-35, 77-78 

(See also Stimulus similarity; Valence) 

Substitute value, 35, 40-41, 77-78 

Success and failure, and level of aspira- 
tion, 43 

and Zeigarnik effect, 33-34. 

Successive approximation, 56-58, 233— 
234, 285-286, 621-622 

Summation, heterogeneous, 589, 592 

Superstitious beha tor, 321n. 

Symbolic logic, 672 


*For a detailed Statement of aims, design, workin, 
chology: A Study of a Science, see Vol, 1, pp. 1-40. 
eral Introduction to the Series”; pp. 19-40, the “In 


Symmetry, strain toward, in social inter- 
action, 79 
Synapse, 362 
System, formal organization of* (see 
hemes of analysis) 
System equation, 642-643 
Systematic formulations, v-vi, 663 673 
behavioral, 5 
definition of, 1, 663 
mathematical, 5 
(See also Models* ; Quantification*) 
substantive vs, methodic, 5 
(See also Themes of analysis; Theory) 
Systematic variables,* 668-669 
vs. empirical, 668, 669 
ee also Variables* ) 
Systematization, Positivistic, 667, 670 
(See also Language, of description* ; 
evel, of analysis* ; Reduction- 
ism* ; Systematic formulations; 
eory) 
Systems, amplifying, 639, 641-642, 644 
iscrete variable and continuous vari- 
able, 639-641 
man-machine, 637-641, 643-645 
(See also Linear frequency theory) 
ystems engineering (see Engineering 
psychology; Frequency analysis; 
inear frequency theory; Man- 
machine systems; Servo theory) 


Tendency, 586-588, 590 

Tension, 23, 29, 32-35 
and associative bonds, 39 3 
Et prion and satiation as affecting, 


inner-personal systems of, 30, 76 
Dil of boundaries as affecting, 
88 
systems, equilibrium in, 33, 54, 79 
(See also Force; Zeigarnik effect) 
Themes of analysis, 2, 4, 5, 663-673 
(See also table of contents for each 
article) 5 
ackground factors and orienting atti- 
tudes,* 666-667 
bracketed numbers in authors’ tables 
of contents (see index numbers, 
below) 
en of function forms,* 670- 


discussion topics suggested, 663-673 

evidence for system,* 672-673 

ormal organization of system,* 671— 
672 


history of system in mediating re- 
. , Search,* 672 
index numbers, use of, 5, 674-675 


g methods, history, etc., of Psy- 
Pages 1-18 comprise the “Gen- 
troduction to Study I Con- 


ne 
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Themes of analysis, initial evidential 
grounds for system,* 669-670 

introduction to, 663-664 

mensurational and quantificational 
procedures,* 671 

methods, concepts, or principles be- 
lieved valuable outside context of 
system,* 673 

programmaticity,* 673 

rationale of, 664-666 

scope of system,* 672 

strategy for development of system, * 

3 


structure of system,* 667-669 
Study II, 3 
(Sea ai Study of psychology, Project 


) 

Theorems, 369, 461, 611 
Coding, 621-622 
derivation of, 306-308, 404 
Fourier integral, 649 
for information transmission, 626-628 
Pythagorean, 200, 202n. 
topological, 60-65 
(See also Axioms* ; Information 

theory; Postulates* ) 

Theory, and art combined, 203, 660 
association by contiguity, 158-195 
cognitive, 243, 262, 339 
construction as part of training in 

psychology, 360 
contiguity, 415-416, 433-436 
(See also Association by contiguity) 
deterministic compared with proba- 
bilistic, 386-387 
ethology, some recent trends, 561-598 
field, 9-10, 82-84 TN 
(See also Lewinian theory; Princi- 
ples of purposive behavior) 


formal opposed to substantive, 5, 611— 


614, 629-631 
graph, 61-65 
Hull-Spence approach, 293-348 
hypothetico-deductive (see Hypothet- 
ico-deductive method*) 
information, 611-634 
of interference, 437-438 
learning (see Learning theory*) 
learning set and error factor, 492-533 
Lewinian, 7-84 
liberalization of basic S-R concepts, 
196-286 


limited vs. general, 202-203, 233-234, 


241-242, 400, 485-486 
linear frequency, 637-660 
methodological requirements of, 638 
micromolar, 242, 260, 297-298, 300, 
341-343 
nature of, 1, 170-172, 200-201, 383- 
386, 396-403, 565, 663-664 
need for, v, 215 
normative, 613, 629-630 
“positivistic,” 667, 670 
premature, 539-540 
purposive behavior, 92-152 


Theory, qualitative vs. quantitative, 280— 
286, 443-445, 472-493, 550-552 
reductionist (see Reductionism*) 
in relation to empirical research (see 
Empiricism ) 
rote-learning, 538-559 
Rea method, case history in, 359— 
sophistication of, and simplification of 
experiments, 203 
statistical learning, 380-486 
technique of analyzing real example 
of, 273 
tests of assumptions of, 405-423 
two-factor, 317, 403-405, 526-531, 
533 
uniprocess, 526-531, 533 
(See also S-R theory; Systematic form- 
ulations; Themes of analysis) 
Thinking, 242-248, 275 
creative, 244-246 
emotional factors in, 246-247 
Thirst, as drive indicating next action, 


interaction with hunger, 223-225, 
253, 278-279, 317 
as intervening variable, 277-278 
Threat posture of birds, 571, 577-578, 
586-587, 590-591 

Threshold, 312, 592 
reduced by drive, 254-256, 325 

Time, 19-20 
scores compared with speed and 

strength-of-pull, 282-283 

Time-dependent phenomena, 463-464 

Time function, 647-648, 652, 657-658 

Time perspective, fear, hope, and guilt 

related to, 54 
as property of life space, 23-24 
Time-on-target error score, 647 
Topological concepts, as aids to descrip- 
tion, 13 
compared with Hullian, 13 
and hodological, 61 
theorems of topology and psycholog- 
ical events, 60-65 

Topological psychology, 9 
(See also Field theory; Lewinian 

theory) 

Tracking, 647-648, 650, 652-657 
pursuit and compensatory, 651-652 
(See also Motor behavior) 

Training scientists, 359-361 

Transfer, 246, 494, 502-503 
and learning sets, 504-505, 521-522 
to measure means-end readiness, 116 
Osgood, surface of, 559 
and stimulus compounding, 423, 458 
(See also Generalization; Retroactive 

inhibition; Stimulus similarity; 
‘Transposition ) 

Transference, 20 

Transform T, 650-651, 653-654 

Transitional states, 640 


706 


Transmission (see Information) 
Transposition, 127, 249, 316, 337-339, 
511 


Tremor, characteristic, frequency, 654 

Trial-and-error learning, 328, 496-497 
and goal-directed thinking, 244-246 
rial operators, 467 


Unconscious, 173 
Uniprocess learning theory, 526-531, 533 
Unit, of material learned, 544-545 
of measurement, 308-309 
(See also Measurement*; Method; 
Quantification* ; Scaling) 
Unlearning, 171-172 


Vacuum activities, 592 
Valence, in group members, 35-36 
as intervening variable, 107-109, 129 
23 


positive and negative, 27-28, 32, 34 

substitute, 34-35, 77-78 

and tension, 32, 34, 55n. 

vs. value, 108-109 

alues, group norms and, 45 

in Lewinian theory, 44-45, 54, 83-84 

in Dead theory, 102, 106n., 108- 
0 


Variable systems, 639-64] 
(See also Linear frequency theory) 
Variables,* 667-671 
anchoring of, 554-555 
dependent, 100, 119, 183-185, 580 
581, 668, 671 
definition of, 668 
empirical vs. systematic, definition 
of, 668-669, 671 
mathematical, definition of, 669 
zepon probability, 392-395, 454- 
5 


time, speed, and strength-of-pull 
scores compared, 282-283 
ical output, 325-328, 639-640, 


empirical vs, systematic distinction, 
Spplications of, 147-148, 214 
215, 223-225, 309-310, 314-315, 
447-450, 454-456, 671 
independent, 667-669, 671 
in contiguity theory, 178-180 
definition of, 667-668 
empirical vs, Systematic, definition 
of, 668-669, 671 
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Variables,* independent, in ethological 
studies, 581-582 
mathematical, definition of, 669 
Past, 99-105 
pointer readings for measuring, 109- 
111 


present, 99, 107, 118 
in statistical learning theory, 455- 
456 


independent-intervening-dependent 
schema, 667-668, 670 
intervening, 57-58, 94-95, 97-98, 100, 
105-109, 111-1 19, 668, 670 
becoming dependent, 542 
comparison of Hullian and Tolman, 
4-135 
in contiguity theory, 180-183 
definition of, 668 
in ethology, 582-583 j 
experimental design to justify unity 
of, 276-280 
input, 314-323, 639, 650 
pointer readings for, 111-114 
systematic vs, empirical, definition 
of, 668-669, 671 
use of mechanisms to relate, 311, 
444 


Variance, analysis of, 444 
ector psychology (see Field theory; 
ewinian theory) 

Vectors, performance, 134—146, 149 
ee also Topological concepts) 

Verbal conditioning, 411-415, 422-423, 
482-484 

VTE (vicarious trial and error), 97, 
138, 140-143, 167-168 


Walden Two, 378 
Waning of response, internal and ex- 
ternal factors in, 593 
Weber’s law, 259, 307-308 
See also Fechner’s law) 
hite noise, 652 
isconsin General Test Apparatus, 495 
ork inhibition, 518, 526 
ee also Reactive inhibition) 
ork theories, fatigue, motivation, and 
stimulus saturation, 553-555 


Zeigarnik effect, 17-18, 19, 32-34 
and resumption of activity, 39-40 
substitution and, 40-41 
(See also Disruption experiments; 

Tension) 
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The extensive investigations reported in 
Psychology: A Study of a Science were | 
sponsored by the American Psychological ` 
Association and subsidized by the National ~ 
Science Foundation. The Study has been _ 
in progress since fall, 1952, under the 
somewhat cryptic designation of “Project 
A.” It has profited from the counsel of a” 
distinguished Advisory Committee under — 
the chairmanship of Dael Wolfle, and the | 
dedicated help of a Panel of Consultants 
to the Director, comprising Howard H. 
Kendler, Lyle H. Lanier, Conrad G, 
Mueller, and Karl E. Zener. 
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Sısmunp Koch received his B.A. from, 
New York University, 1938; his M.A.} 
from State University of Iowa in 1939; 
and his Ph.D. from Duke University, 
1942. Since 1942 his academic career has) 
been centered mainly at Duke University, 
where he is Professor of Psychology. ; 


During 1947-1948 he was visiting Associ- 

ate Professor of Psychology, Clark Uni- 
versity. In 1952-1953 he was Fulbright fi 
Professor at the University of London © 
(University College). ! 


It was during his Fulbright year that Pro- 
fessor Koch began, on a part-time basis, 
to work on what was then known as Proj- 
ect A of the American Psychological As- 1 
sociation Study of the Status and Devel- 

opment of Psychology in the U.S., and © 
which eventuated in Psychology: A Study 
of a Science. > 


During 1953-1955 he administered part © 
of the project that resulted in Psychology: ~ 
A Study of a Science at the Institute of 1 
Human Relations of Yale University, 
where he was a guest of the Psychology ` 
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